(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 




iininiiiiiiiiiiiniiii 



(43) International Publication Date (10) International Publication Number 

18 January 2001 (18.01.2001) PCT WO 01/04282 A2 



(51) International Patent Classification 7 : C12N 15/00 

(21) International Application Number PCT/USOO/1 8971 

(22) International Filing Date: 12 July 2000(12.07.2000) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

09/351,778 



12 July 1999 (12.07.1999) US 



(63) Related by continuation (CON) or continuation-in-part 
(CIP) to earlier application: 

US 09/351,778 (CIP) 

Filed on 12 July 1999 (12.07.1999) 

(71) Applicant (for all designated States except US): SAINT 
LOUIS UNIVERSITY [US/US]; 221 N. Grand, Sl Louis, 
MO 63103 (US). 



(72) Inventors; and 

(75) Inventors/Applicants (for US only): WOLD, William S^ 
M. [CA/US]; 1609 Adgers Wharf Boulevard, Chesterfield, 
MO 63017 (US). TOTH, Karory [HU/US]; 7345 Fern- 
brook, Apt. 202, SL Louis, MO 63123 (US). DORONIN, 
Konstantin [RU/US]; 8133 BriaAaven Trail, Apt 304, St 
Louis, MO 63 123 (US). TOLLEFSON, Ann, E. [US/US]; 
9026 Philo Avenue, St Louis, MO 63123 (US). 

(74) Agents: GENDLOFF, Elie, H. et al.; Suite 1400, 7733 
Forsyth Blvd., St. Louis, MO 63105-1817 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, CA, CH, CN, CR, CU, CZ, DE, 
DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, 
ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, 
LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, NO, NZ, 
PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, 
TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 

[Continued on next page] 



= (54) Title: REPLICATION-COMPETENT ANTI-CANCER VECTORS 



E1 A Functions Major Ute Transcription Una 
• Induce Ad Qenet. 

. OeromriiteoeDcycte " u UADPL5 

. OrivttQ.into&phisa. «^^M><><><«><> 

EIA E1B E3 



< 

00 







t 


ir 


B. fP~+ 


E3 ^ 










EIA 




E4 


n. =t> tAAi — 






KDiiyy 


1 1 


1 






ir 


EIA 

0MD7 i A 2. 9 


^^ADP 






SP8-P 


kdi-spb|XX 


1 1 


II 




E2 


E4 



Q (57) Abstract: Novel vectors which are replication-competent in neoplastic cells and which overexpress an adenovirus death pro- 
^ tein are disclosed Some of the disclosed vectors are replication-restricted to neoplastic cells or to neoplastic alveolar type II cells. 
^ Compositions and methods for promoting the death of neoplastic cells using these replication-competent vectors are also disclosed. 



BEST AVAILABLE COPY 



WO 01/04282 A2 IIII1I1MI1DMIM1III 



patent (AT, BE, CH, CY, DE, DK, ES, H, FR, GB, GR, IE, 
IT, LU, MC, NL, FT, SE), OAPI patent (BF, BJ, CF, CG. 
CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— Without international search report and to be republished 
upon receipt of that report. 



For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations 0 appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



WO 01/04282 



PCT/US00/18971 



Replication-Competent Anti-Cancer Vectors 
Reference to Government Grant 

This invention was made with government support under a grant from the National 
Institutes of Health, Grant Number ROl CA71704 and CA81829. The United States 
Government has certain rights in this invention. 
5 Background of the Invention 
(1) Field of the Invention 

This invention relates generally to the treatment of cancer and more particularly to 
vectors which replicate in neoplastic cells and which overexpress an adenovirus death protein 
(ADP) and to the use of these vectors in treating human cancer. 
1 0 (2) Description of the Related Art 

Cancer is a leading cause of death in the United States and elsewhere. Depending on 
the type of cancer, it is typically treated with surgery, chemotherapy, and/or radiation. These 
treatments often fail: surgery may not remove all the cancer; some cancers are resistant to 
chemotherapy and radiation therapy; and chemotherapy-resistant tumors frequently develop. 
1 5 New therapies are necessary, to be used alone or in combination with classical techniques. 

One potential therapy under active investigation is treating tumors with recombinant 
viral vectors expressing anti-cancer therapeutic proteins. Adenovirus-based vectors contain 
several characteristics that make them conceptually appealing for use in treating cancer, as 
well as for therapy of genetic disorders. Adenoviruses (hereinafter used interchangeably with 
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" Ads") can easily be grown in culture to high titer stocks that are stable. They have a broad 
host range, replicating in most human cancer cell types. Their genome can be manipulated by 
site-directed mutation and insertion of foreign genes expressed from foreign promoters. 

The adenovirion consists of a DNA-protein core within a protein capsid (reviewed by 
5 Stewart et al., "Adenovirus structure by x-ray crystallography and electron microscopy. 0 in: 
The Molecular Repertoire of Adenoviruses, Doerfler, W. et al., (ed)., Springer-Verlag, 
Heidelberg, Germany, p. 25-38). Virions bind to a specific cellular receptor, are endocytosed, 
and the genome is extruded from endosomes and transported to the nucleus. The genome is a 
linear duplex DNA of about 36 kbp, encoding about 36 genes (Fig. 1 A). In the nucleus, the 

1 0 "immediate early" El A proteins are expressed initially, and these proteins induce expression 
of the "delayed early** proteins encoded by the E1B, E2, E3, and E4 transcription units 
(reviewed by Shenk, T. "Adenoviridae: the viruses and their replication" in: Fields Virology, 
Field, B .N. et al., Lippencott-Raven, Philadelphia, p. 211 1-2148). El A proteins also induce 
or repress cellular genes, resulting in stimulation of the cell cycle. About 23 early proteins 

1 5 function to usurp the cell and initiate viral DNA replication. Viral DNA replicates at about 7 
h post-infection (p.i.)» then late genes are expressed from the <4 major late" transcription unit 
Major late mRNAs are synthesized from the common "major late promoter" by alternative 
pre-mRNA processing. Each late mRNA contains a common "tripartite leader" at its 5'- 
tenninus (exons 1, 2, and 3 in Fig. 1), which allows for efficient translation of Ad late 

20 mRNAs. Cellular protein synthesis is shut off, and the cell becomes a factory for making 
viral proteins. Virions assemble in the nucleus at about 1 day p.i., and after 2-3 days the cell 
lyses and releases progeny virus. Cell lysis is mediated by the E3 1 1 .6K protein, which has 
been renamed "adenovirus death protein" (ADP) (Tollefeon et al., J. Virol 70:2296-2306, 
1996; Tollefson et al., ViroL 220:152-162, 1996). The term ADP as used herein in a generic 

25 sense refers collectively to ADFs from adenoviruses such as, e.g. Ad type 1 (Adl), Ad type 2 
(Ad2), Ad type 5 (Ad5) or Ad type 6 (Ad6) all of which express homologous ADFs with a 
high degree of sequence similarity. 

Human adenovirus type 5 (Ad5) is particularly useful for cancer gene therapy. It 
primarily causes asymptomatic or mild respiratory infections in young children, followed by 

30 long term effective immunity. Fatalities are extremely rare except when the patient is 

immunocompromised (Horwitz, M. S., Adenoviruses, p. 2149-2171 In B. N. Fields, D. M. 
Knipe, and P. M. Howley (eds.), Fields Virology, Lippincott-Raven Publishers, Philadelphia, 
PA, 1996). Ad5 is very well understood, can be grown in culture to high titer stocks that are 
stable, and can replicate in most human cancer cell types (Shenk, T., Adenoviridae: the 

35 viruses and their replication, p. 21 1 1-2148. In B. N. Fields, D. M. Knipe, and P. M. Howley 



WO 01/04282 



PCT/US00/18971 



3 

(eds.), Fields Virology, Lippincott-Raven, Philadelphia, 1996). Its genome can be 
manipulated by site-directed mutagenesis and insertion of foreign sequences. 

The Ad vectors being investigated for use in anti-cancer and gene therapy are based 
on recombinant Ad's that are either replication-defective or replication-competent Typical 
5 replication-defective Ad vectors lack the El A and E1B genes (collectively known as El) and 
contain in their place an expression cassette consisting of a promoter and pre-mRNA 
processing signals which drive expression of a foreign gene. The El A proteins induce 
transcription of other Ad genes, and in nontransformed cells they deregulate the cell cycle, 
induce or repress a variety of cellular genes, and force cells from G 0 into S-phase 48 (White, 

10 E., Semin. Virol. &505-5 13, 1998; Wold et al., pp. 200-232 In AJ. Cann (ed), DNA Virus 
Replication: Frontiers in Molecular Biology, Oxford University Press, Oxford). The E1B 
proteins inhibit cellular apoptosis. Id. These vectors are unable to replicate because they lack 
the El A genes required to induce Ad gene expression and DNA replication. In addition, the 
E3 genes are usually deleted because they are not essential for virus replication in cultured 

15 cells. 

A number of investigators have constructed replication-defective Ad vectors 
expressing anti-cancer therapeutic proteins. Usually, these vectors have been tested by direct 
injection of human tumors growing in mouse models. Most commonly, these vectors express 
the thymidine kinase gene from herpes simplex virus, and the mice are treated with 

20 gancyclovir to kill cells transduced by the vector (see e.g., Felzmann et al., Gene Ther. 

4:1322-1329, 1997). Another suicide gene therapy approach involves injecting tumors with a 
replication defective Ad vector expressing cytosine deaminase, followed by administration of 
5-fluorocytosine (Topf et al., Gene Ther. 5:507-513, 1998). Investigators have also prepared 
and tested replication-defective Ad vectors expressing a cytokine-such as IL-2, IL-12, IL-6, 

25 tumor necrosis factor (TNF), type I interferons, or the co-stimulatory molecule B7-1 in the 
anticipation that the Ad-expressed cytokine will stimulate an immune response, including 
cytotoxic T-lymphocytes (CTL), against the tumor (Felzmann et al., supra; Putzer et al., Proc. 
Natl Acad. Sci. USA 94: 10889-10894, 1997). Other vectors express tumor antigens (e.g. 
melanoma MARTI), proteins that de-regulate the cell cycle and induce apoptosis (p53, pRB, 

30 p21 Kipl/WAF \ pio^ 01 ^ 2 , and even Ad El A), and ribozymes. An Ad vector expressing FasL 
induces apoptosis and tumor regression of a mouse tumor (Arai et al., Proc Natl Acad. Set 
USA 9*13862-13867, 1997). 

Despite these generally positive reports, it is recognized in the art that 
replication-defective Ad vectors have several characteristics that make them suboptnnal for 

35 use in therapy. For example, production of replication-defective vectors requires that they be 
grown on a complementing cell line that provides the El A proteins in trans. Such cell lines 
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are fastidious, and generation of virus stocks is time-consuming and expensive. In addition, 
although many foreign proteins have been expressed from such vectors, the level of 
expression is low compared to Ad late proteins. 

To address these problems, several groups have proposed using replication- 
5 competent Ad vectors for therapeutic use. Replication-competent vectors retain Ad genes 
essential for replication and thus do not require complementing cell lines to replicate. 
Replication-competent Ad vectors lyse cells as a natural part of the life cycle of the vector. 
Another advantage of replication-competent Ad vectors occurs when the vector is engineered 
to encode and express a foreign protein. Such vectors would be expected to greatly amplify 

1 0 synthesis of the encoded protein in vivo as the vector replicates. However, in order to prevent 
RC vectors from damaging normal tissues and causing disseminated viremia, it is important 
that they have some feature that limits their replication to cancer cells. 

Wyeth Laboratories developed replication-competent Ad vectors for vaccination 
purposes, using vaccine strains of Ad serotypes 4, 7, and 5 (Lubeck et al., AIDS Res. Hum. 

15 Retroviruses 70:1443-1449, 1994). Foreign genes were inserted into the E3 region (with the 
E3 genes deleted) or into a site at the right end of the genome. Two foreign genes used were 
hepatitis B surface antigen and the HIV envelope protein. They obtained good expression in 
culture, and were able to raise antisera in animal models. Phase I human trials were 
ambiguous, and the project was mostly abandoned. 

20 Onyx Pharmaceuticals recently reported on adenovirus-based anti-cancer vectors 

which are replication deficient in non-neoplastic cells but which exhibit a replication 
phenotype in neoplastic cells lacking functional p53 and/or retinoblastoma (pRB) tumor 
suppressor proteins (U.S. Patent No. 5,677,178; Heise et al., Nature Med 5:639-645, 1997; 
Bischoff et al., Science 274:373-376, 1996). This phenotype is reportedly accomplished by 

25 using recombinant adenoviruses containing a mutation in the E1B region mat make the 

encoded E1B-55K protein incapable of binding to p53 and/or a mutation(s) in the El A region 
which make the encoded El A protein (p289R or p243R) incapable of binding to pRB and/or 
the cellular 300 kD polypeptide and/or the 1 07 kD polypeptide. E1B-55K has at least two 
independent functions: it binds and inactivates the tumor suppressor protein p53, and it is 

30 required for efficient transport of Ad mRNA from the nucleus. Because these E1B and E1A 
viral proteins are involved in forcing cells into S-phase, which is required for replication of 
adenovirus DNA, and because the p53 and pRB proteins block cell cycle progression, the 
recombinant adenovirus vectors described by Onyx should replicate in cells defective in p53 
and/or pRB, which is the case for many cancer cells, but not in cells with wild-type p53 

35 and/or pRB. Onyx has reported that replication of an adenovirus lacking E1B-55K, which is 
named ONYX-015, was restricted to p5 3 -minus cancer cell lines (Bischoff et al M supra), and 
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that ONYX-015 slowed the growth or caused regression of a p5 3 -minus human tumor 
growing in nude mice (Heise et al., supra). Others have challenged the Onyx report claiming 
that replication of ONYX-015 is independent of p53 genotype and occurs efficiently in some 
primary cultured human cells (Harada and Berk, J. Virol 75:5333-5344, 1999). It is now 
5 known that ONYX-0 1 5 can replicate in cells with wild-type p53 (Goodrum et al., /. Virol. 
72:9479-9490, 1998; Harada et al., /. Virol 75:5333-5344, 1999; Hay et al., Hum. Gene Ther. 
70:579-590, 1999; Rothmaim et al., J. Virol. 72:9470-9478, 1998; Turnell et al., J. Virol 
75:2074-2083, 1999). ONYX-015 does not replicate as well as wild-type adenovirus because 
E1B-55K is not available to facilitate viral mRNA transport from the nucleus. Also, ONYX- 

10 015 expresses less ADP than wild-type virus (see Example 1 below). 

As an extension of the ONYX-015 concept, a replication-competent adenovirus 
vector was designed that has the gene for E1B-55K replaced with the herpes simplex vims 
thymidine kinase gene (Wilder et al., Gene Therapy tf:57-62, 1999). The group that 
constructed this vector reported that the combination of the vector plus gancyclovir showed a 

15 therapeutic effect on a human colon cancer in a nude mouse model (Wilder et al., Cancer Res. 
59:410-413, 1999). However, this vector lacks the gene for ADP, and accordingly, the vector 
will lyse cells and spread from cell-to-cell less efficiently than an equivalent vector that 
expresses ADP. The gene for ADP is also lacking in another replication-competent 
adenovirus vector that has been described, in which a minimal enhancer/promoter of the 

20 human prostate specific antigen was inserted into the adenovirus El A enhancer/promoter 
(Rodriguez et ah, Cancer Res. 57:2559-2563, 1997). 

Another strategy, for replication-competent vector improvement is to place replication 
under the control of tissue-specific promoters. One group replaced the basal El A promoter 
with a modified promoter for a- fetoprotein (AFP) (Hallenbeck et al., Hum. Gene Ther. 

25 70:1721-1733, 1999). AFP is expressed in the liver during development, but it is not 
expressed in adults. However, it is expressed in 70-80% of patients with hepatocellular 
carcinoma. Growth of this vector was limited to AFP-expressing cells and the vector showed 
some suppression of xenotransplants. Id. A series ofRC vectors has also been developed 
that have expression of the El A and E1B genes dependent on the prostate tumor-specific 

30 prostate specific antigen (PSA) and kallikrein promoters/enhancers (Rodriguez et al., Cancer 
Res. 60:\ 196, 1997; Yu et al., Cancer /ter.59:4200-4203, 2000; Yu et al., Cancer Res 
59:1498-1504,1999). 

Thus, there is a continuing need for vectors that replicate and spread efficiently in 
tumors but that can be modified such that they replicate poorly or not at all in normal tissue. 

35 Summary of the Invention 
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Briefly, therefore, the present invention is directed to novel vectors which are 
replication competent in neoplastic cells and which overexpress an adenovirus death protein 
(ADP). The work reported herein demonstrates the discovery that overexpression of ADP by 
a recombinant adenovirus allows the construction of a replication-competent adenovirus that 
5 kills neoplastic cells and spreads from cell-to-cell at a rate similar to or faster than that 
exhibited by adenoviruses expressing wild-type levels of ADP, even when the recombinant 
adenovirus contains a mutation that would otherwise reduce its replication rate in non- 
neoplastic cells. This discovery was unexpected because it could not have been predicted 
from what was known about adenovirus biology that Ad vectors overexpressing ADP remain 

10 viable and that the infected cells are not killed by the higher amounts of ADP before the Ad 
vector produces new virus particles that can spread to other tumor cells. Indeed, naturally- 
occurring adenoviruses express ADP in low amounts from the E3 promoter at early stages of 
infection, and begin to make ADP in large amounts only at 24-30 h p.i., once virions have 
been assembled in the cell nucleus. It is believed that other non-adenoviral vectors can be 

1 5 used to deliver ADP r s cell-killing activity to neoplastic cells, including other viral vectors and 
plasmid expression vectors. 

Thus, in one preferred embodiment, the ADP-expressing vector comprises a 
recombinant adenovirus lacking expression of at least one £3 protein selected from the group 
consisting of: gpl9K; RIDa (also known as 10.4K); RIDP (also known as 14.5K) and 14.7K. 

20 Because these E3 proteins inhibit immune-mediated inflammation and/or apoptosis of Ad- 
infected cells* it is believed that a recombinant adenovirus lacking one or more of these E3 
proteins will stimulate infiltration of inflammatory and immune cells into a tumor treated with 
the adenovirus and that this host immune response will aid in destruction of the tumor as well 
as tumors that have metastasized. The ADP expressed by preferred embodiments comprises a 

25 naturally^)ccuning amino acid sequence from a human adenovirus of subgroup C, namely 
Adl,Ad2,Ad5andAd6. 

In another embodiment, replication of the vector is restricted to neoplastic cells. Such 
replication-restricted vectors are useful in treating cancer patients in which it is desirable to 
eliminate or reduce damage to normal cells and tissues that might be caused by the vector, 

30 particularly viral vectors that kill the host cell as part of their life cycle. In preferred 

embodiments, a recombinant adenovirus has a replication-restricted phenotype because die 
recombinant adenovirus is incapable of expressing an El A viral protein which binds the pRB 
and the p300/CBP proteins or because the E4 promoter has been substituted with a promoter 
that is activated only in neoplastic cells and/or cells of a specific tissue. 

35 In yet another embodiment, the invention provides a vector which o ver expr esses ADP 

and whose replication is under the control of a tissue specific promoter, tumor specific 
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promoter or an inducible promoter. In preferred embodiments, the vector comprises a 
recombinant adenovirus in which the tissue specific promoter or inducible promoter is 
substituted for the E4 promoter. Such vectors are useful for restricting replication of the 
vector and its ADP-mediated cell killing to cells of a particular type or to cells exposed to an 
5 exogenous agent that activates the promoter. A preferred tissue-specific or inducible vector 
also expresses a phenotype that restricts its replication to neoplastic cells. 

In yet another embodiment, the invention provides a vector which overexpresses ADP 
but which is not restricted to tumors by a specific genetic modification. Such a vector is more 
destructive to neoplastic cells than even the naturally occurring Ad's of subgroup C. In 

10 preferred embodiments, this vector could be used for patients with terminal cancer not 

treatable by another method, and who have pre-existing neutralizing antibodies to Ad or to 
which neutralizing antibodies can be administered. 

In still another embodiment, the invention provides a composition comprising a first 
recombinant virus which is replication competent in a neoplastic cell and overexpresses the 

1 5 adenovirus death protein. In one embodiment, the recombinant vims is contained within a 
delivery vehicle comprising a targeting moiety that limits delivery of the virus to cells of a 
certain type. With this embodiment, the replication-competent vector can be of any ADP- 
overexpressing configuration described herein. In some embodiments, the composition also 
comprises a second recombinant virus which is replication-defective and which expresses an 

20 anti-cancer gene product In some embodiments, the replication-defective vector may be 
engineered to overexpress ADP when replication of this vector is complemented by a 
replication-competent vector. The recombinant virus complements spread of the replication- 
defective virus, as well as its encoded anti-cancer product, throughout a tumor. In preferred 
embodiments, the first recombinant virus is a recombinant adenovirus whose replication is 

25 restricted to neoplastic cells and/or which lacks expression of one or more of the E3 gpl9K; 
RJDa; RIDP; and 14.7K proteins. 

In additional embodiments, the invention provides replication-competent vectors that 
overexpresses an ADP and also expresses an anti-cancer product As with previous 
embodiments, the vector can be of any ADP-overexpressing configuration provided herein. 

30 Preferably, replication of the virus is engineered to (a) be restricted to neoplastic cells, e.g., by 
replacing the E4 promoter with a tissue specific or tumor specific promoter and/or (b) lack 
expression of one or more of tie E3 gpl9K; RTDa; RTDp; and 14.7K proteins. In some 
embodiments, the anti-cancer product is inserted into the E3 region. 

The ADP-expressing vectors and compositions of the invention are useful in a 

35 method for promoting death of a neoplastic cell. The method comprises contacting the 

neoplastic cell with a vector which is repbcation-competent in the neoplastic cell and which 
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overcxpresses ADP. Where the neoplastic cell comprises a tumor in a patient, the vector is 
administered directly to the tumor or, in other embodiments, the vector is administered to the 
patient systemically or in a delivery vehicle containing a targeting moiety that directs delivery 
of the vector to the tumor. In embodiments where the vector is a recombinant vims, the 
5 method can also comprise passively immunizing the patient against the vims. 

In yet another embodiment of the invention, the vector may be used in combination 
with radiation therapy. The radiation therapy can be any form of radiation therapy used in the 
art such as for example, external beam radiation such as x-ray treatment, radiation delivered 
by insertion of radioactive materials within the body near or at the tumor site such as 
1 0 treatment with gamma ray emitting radionuclides, particle beam therapy which utilizes 

neutrons or charged particles and the like. In addition, this embodiment encompasses the use 
of more than one of the vectors of the present invention in a cocktail in combination with 
radiation therapy. 

Another embodiment of the invention involves the use of the recombinant vector in 

15 combination with chemotherapy as has been disclosed for other adenovirus vectors (U.S. 
Patent No. 5,846,945). Chemotheraputic agents are known in the art and include 
antimetabolites including pyrimidine-analogue and purine-analogue antimetabolites, plant 
alkaloids, antitumor antibiotics, alkylating agents and the like. The use of more than one of 
the vectors of the present invention with a chemotheraputic agent or agents is also 

20 contemplated within this embodiment 

Among the several advantages found to be achieved by the present invention, 
therefore, may be noted the provision of replication-competent vectors, particularly viruses, 
which rapidly kill cancer cells and spread from cell-to-cell in a tumor; the provision of such 
vectors whose replication can be induced or which is restricted to tumors and/or to cells of a 

25 certain tissue type; and the provision of compositions and methods for anti-cancer therapy 
which cause little to no side effects in normal tissues. 
Brief Description of the Drawings 

Figure 1 is a schematic of gene expression in Ad5 (Fig. 1A) and KD3, a preferred 
embodiment of the invention (Fig. IB), in which the respective genomes are represented by 

30 the stippled bars and transcription units represented by arrows above and below the bars, with 
the E3 proteins listed above the arrows for the E3 transcription unit, and the LI to L5 families 
of late mRNA's indicated. 

Figure 2 illustrates the overexpression of ADP by KD1, KD3 , GZ1, and GZ3 
showing an immunbblot of proteins isolated from human A549 cells infected with the 

35 indicated viruses and probed with an anti-ADP antibody, with ADP indicating differently 
glycosylated and proteolyticaily processed forms of ADP. 
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Figure 3 illustrates that the El A dl\ 101/1 107 mutation referred to in the figure and 
hereinafter as <f/01/07, retards expression of late proteins, showing an immunoblot of E1A 
proteins and late proteins in A549 cells infected with the indicated viruses in the absence 
(Figs. 3A and 3B) or presence (Figs. 3C and 3D) of d/327, which has a wild-type El A region 
5 and has a deletion of all E3 genes but the gene encoding the 12.5K protein (Figs. 3C and 3D). 
An antiserum specific to the El A proteins was used for Fig. 3A and 3C. An antiserum raised 
against Ad5 virions was used for Figs. 3B and 3D. 

Figure 4 illustrates that KD1 and KD3 kill cells more efficiently than control viruses 
that express less or no ADP, showing a graph of the percent of A549 cells infected with the 
10 indicated viruses that were viable at the indicated days p.i. as determined by trypan blue 
exclusion. 

Figure 5 is a cell spread assay illustrating that overexpression of ADP enhances 
spread of virus from cell to cell, showing monolayers infected with the indicated viruses at the 
indicated PFU/cell which were treated at 7 days p.i. with crystal violet, which stains live cells 

15 but not dead cells. 

Figure 6 illustrates that KD1 and KD3 replicate well in growing cells but not in 
growth-arrested cells showing the virus titer extracted from growing or growth arrested HEL- 
229 cells at various times following infection with 100 PFU/ml of the following viruses: 
dl3Q9 (Fig. 6A), rf/01/07 (fig. 6B), KD1 (Fig. 6C) and KD3 (Fig 6D). 

20 Figure 7 illustrates that KD1 and KD3 are defective in killing primary human 

bronchial epithelial cells showing these cell monolayers infected at 30% confluency with 10 
PFU/ml of the indicated viruses and stained at 5 days p.i. with neutral red. 

Figure 8 illustrates that KD1 and KD3 reduce the growth rate of human A549 cell 
tumors growing in nude mice, showing in Fig. 8 A a graph of average-fold increase in tumor 

25 size plotted against the number of weeks following infection of the tumor with buffer or with 
5 x 10 7 PFU at weekly intervals of or the indicated viruses, and showing in Fig. 8B a similar 
graph of tumors injected once with 5 x 10 8 PFU of KD3 or GZ3. 

Figure 9 illustrates that KD1 and KD3 reduce the growth rate of human Hep3B cell 
tumors growing in nude mice, showing a graph of average-fold increase in tumor size plotted 

30 against the number of weeks following injection of the tumor with buffer or with 5 x 1 0 7 PFU 
of d/309, KD1 or KD3 at twice weekly intervals of the indicated viruses. 

Figure 10 illustrates that KD1 and KD3 complement the replication and spread of Ad- 
f)-gal, a replication-defective vector that expresses (3-galactosidase, using an infectious center 
assay showing in Fig. 10A a picture of A549 cell monolayers seeded with A549 cells infected 

35 with Ad-0-gal alone or with the indicated viruses, with Figs 10B and 10C showing close-up 
views of two of the monolayers of Fig. 10A. 
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Figure 1 1 is a bar graph illustrating that KD1 and KD3 increase the expression of 
luciferase in human Hep3B cell tumors growing in nude mice, using an assay in which tumors 
were injected with the indicated combinations of viruses, then were extracted 2 weeks p.i. and 
assayed for luciferase activity. The numbers in parentheses indicated the fold increase in 
5 luciferase activity compared to that of the Adluc vector plus buffer. 

Figure 12 is a graph showing the results of a standard plaque development assay for 
KD1 and KD1-SPB on A549 cells engineered to express the TTF1 transcription factor 
(A549/TTF1) and the parental 549 cells, in which data are plotted as the number of plaques 
observed on a particular day in the assay divided by the final number of plaques observed for 
1 0 that virus multiplied by 1 00. 

Figure 13 is a cell spread assay for KD1 and KD1-SPB on H441 cells and Hep3B 
cells, where cells were infected with the indicated amounts of KD1 or KD1-SPB and H441 
cells and Hep3B cells were strained with crystal violet at 5 days p.i. and 8 days p.i., 
respectively. 

1 5 Figure 1 4 is a graph showing the results of a standard plaque development assay for 

rf/309 and two preferred embodiments of the invention, GZ1 and GZ3, in which data are 
plotted as the number of plaques observed on a particular day in the assay divided by the final 
number of plaques observed for that virus multiplied by 100. 

Figure 15 is a cell spread assay illustrating that the combination of KD1, KD3, GZ1, 

20 or GZ3 with x-ray radiation is more effective in destroying A549 cell monolayers than is 

virus vector alone or radiation alone, wherein cells were infected with the indicated amounts 
of the indicated viruses, radiated with 600 centi greys (cGy) of x-radiation (bottom panel), or 
mock radiated (top panel), then stained with crystal violet at 6 days p.i. 

Figure 16 is a graph of a cell spread assay illustrating that 10~ 3 PFU of KD1, KD3, 

25 GZ1, or GZ3 used in combination with 150, 300, or 600 centigreys of radiation is more 

effective in destroying A549 cell monolayers than vims vector alone or radiation alone. Cell 
viability is based on the amount of crystal violet extracted from the culture wells, using the 
mock-infected non-radiated well as 100% viability. 

Figure 17 illustrates that the combination of KD3 or GZ3 plus x-ray radiation is more 

30 effective in reducing the growth of A549 cell tumors growing in nude mice than KD3 alone or 
GZ3 alone. 

Figure 18 illustrates a structure-function analysis of ADP, showing in Fig. 18A the 
amino acid sequence of the adenovirus death protein encoded by Ad2, with the various 
putative domains and glycosylation sites labeled and showing in Fig. 1 8B a schematic of the 
35 ADP gene in rerfQO and in the indicated deletion mutants, with the right column 
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summarizing the death promoting phenotype of the various mutants as a percentage of the 
wild-type phenotype. 

Figures 19A and 19B illustrate a cell viability assay of the indicated ADP mutants 
showing a graph of viability as determined by trypan blue exclusion plotted against hours 
5 (Fig. 19A) or days (Fig. 19B) postinfection. 

Figure 20 depicts the amino acid sequence, shown in single letter code, for the ADP 
proteins of Adl, Ad2, Ad5, and Ad6 (SEQ ID NOS:5-8), for the Ad2 ADP mutants i/716, 
<ff715, <tn\4, and J/737 (SEQ ID NOS:9-12), and for the putative lumenal domain (SEQ ID 
NO: 17), the transmembrane domain (SEQ ID NO: 18), the cytosolic basic-proline domain 
1 0 (SEQ ID NO: 1 9), and the remainder of the cystosolic domain (SEQ ED NO:20) of the ADP 
protein of Ad2. 

Figure 21 presents the complete nucleotide sequence of the genome of Ad5. 

Figure 22 presents the complete nucleotide sequence of the genome of KD1 (SEQ ID 

NO:l). 

15 Figure 23 presents the complete nucleotide sequence of the genome of KD3 (SEQ ID 

NO:2). 

Figure 24 is a schematic of the following vectors: A. Ad5. The stippled bar 
indicates the DNA genome of 36 kbp. The open arrow indicates the immediate early El A 
transcription unit, and the black arrows are the delayed early E1B, E2, E3, and E4 

20 transcription units. The hatched arrows indicate the five families of major late mRNAs, and 
also the ADP mRNA, which is synthesized as part of the major late transcription unit Each 
major late mRNA has a tripartite leader (leaders 1, 2, and 3) spliced to its 5' terminus. B. 
dl309. dl309 is identical to Ad5 except it has the E3-RTD and E3-14.7K genes deleted. dl309 
expresses ADP at levels similar to Ad5. C. KD1 . KD1 has two small deletions (indicated by 

25 "X" marks) in the El A gene that abolish binding of the El A proteins to pRB or p300/CBP. It 
lacks all E3 genes except adp. ADP is expressed earlier in infection and in greater abundance 
than is ADP from Ad5 or dl309 Doronin et al., J. Virol 74:6147-6155. D. KD1-SPB. KD1- 
SPB is identical to KD1, except it has the E4 promoter replaced by the promoter for 
Surfactant Protein B (SPB-P). 

30 Figure 25 presents graphs illustrating that KD1-SPB grows as well as KD1 in H441 

lung carcinoma cells but much more poorly than KD1 in Hep 3B hepatoma cells. CsCl- 
banded stocks of KD1-SPB and KD1 were titered using standard methods (Tollefson et al., p. 
1-9 In W.S.M. Wold (ed.), Adenovirus Methods and Protocols. Humana Press, Inc., Totowa, 
NJ, 1998) on 293-E4 or 293 cells (A), or on A549 cells (B). The data are plotted as the 

35 number of plaques seen on any day of the plaque assay as a percentage of the number of 
plaques seen on the final day of the assay (Tollefson et al., Virology 220:152-162, 1996). 
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Figure 26 presents micrographs illustrating that KD1-SPB induces CPE in H441 cells 
but not Hep 3B cells. H441 and Hep 3B monolayers were mock-infected or infected with 10 
PFU/cell of KD1 or KD 1-SPB, then photographed under phase contrast at 4 or 7 days p.i. 

Figure 27 depicts Southern hybridizations and a graph illustrating that KD 1-SPB 
5 DNA is synthesized efficiently in H441 but not Hep 3B cells. H441 or Hep 3B cells were 
infected with 10 PFU/cell of KD1 or KD1-SPB. Total genomic DNA was isolated at 0, 5, 24, 
48, 72, and 96 h p.i., digested with Hindm, resolved by agarose gel electrophoresis, blotted, 
and hybridized with 32 P-labeled Ad DNA. A. Autoradiogram. B. Phosphorimager 
quantitation of the DNA bands in Panel A. 
10 Figure 28 presents graphs depicting single step growth curves showing that KD 1-SPB 

grows well in H441 but not Hep 3B cells. Cells were infected with 10 PFU/cell of KD1 or 
KD1-SPB. Vectors were extracted at the indicated days p.i. and titers determined by plaque 
assay. 

Figure 29 depicts immunoblots showing that KD 1-SPB expresses E40RF3 and ADP 

1 5 in H441 but not Hep 3B cells. Cells were infected with 10 PFU/cell of KD1 or KD1-SPB. At 
24 h p.i., protein extracts were analyzed for El A, E40RF3, and ADP using specific antisera. 
The El A proteins appear as multiple bands. ADP appears as two bands; the upper band is 
glycosylated and the lower band is a proteolytically cleaved species (Scaria et al., Virology 
7P/:743-753, 1992; Tollefson et al., /. Virol 5tf:3633-3642). 

20 Figure 30 depicts immunofluorescence micrographs showing that KD 1-SPB 

expresses E40RF3 in H441 but not Hep 3B cells. Cells growing on coverslips were infected 
with 20 PFU/cell of KD1, KD1-SPB, or dl309 (wild-type). At 48 h (Panel A) or 6 days 
(Panel B), cells were fixed and stained with a rabbit polyclonal antipeptide antiserum against 
E40RF3. Photographs were taken using a 100X Planapo lens. Each panel shows about 8 

25 nuclei. This figure is part of the same experiment shown in Figure 3 1 . 

Figure 3 1 depicts immunofluorescence micrographs showing that KD 1-SPB does not 
express E2-DBP or fiber efficiently in Hep 3B cells. Hep 3B cells were infected with 20 
PFU/cell of KD1-SPB or KD1. At 48 h (A) or 6 days (B) p.i., cells were fixed and double- 
stained using a rabbit polyclonal antiserum against DBP and a mouse monoclonal antibody 

30 against fiber. The same fields are shown for DBP and fiber. This figure is part of the same 
experiment shown in Figure 30. 

Figure 32 presents graphs illustrating that KD1-SPB lyses H441 but not Hep 3B as 
efficiently as KD1 . H441 or Hep 3B cells were mock-infected or infected with 20 PFU/cell 
of KD1 or KD 1-SPB. Cell lysis was determined by release of lactate dehydrogenase from the 

35 cells into the medium. 
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Figure 33 presents graphs illustrating that KD1-SPB suppresses growth of H441 
tumors in nude mice equally as well as KD1. Tumor cells were injected into flanks of nude 
mice and allowed to grow to about 100 uJ (H441) or 150 ul (Hep 3B) volumes. Tumors (n = 
10) were injected with DMEM (mock) or with 5 x 10 7 PFU of KD1 or KD1-SPB. Injections 
5 of the viruses were repeated twice weekly for 3 weeks to a total dose of 3.0 x 10 8 PFU per 
tumor. Tumors were measured and the mean fold-increase in tumor size was calculated. 
Description of the Preferred Embodiments 

In accordance with the present invention, it has been discovered that overexpression 
of ADP by a recombinant adenovirus results in raster lysis of cells and spread of the virus 
1 0 throughout a cell monolayer than viruses expressing wild-type levels of ADP. It has also 
been discovered that this function for ADP is manifest in an adenovirus that contains El A 
mutations that restrict adenoviral replication to neoplastic cells. Thus, vectors which are both 
replication competent in neoplastic cells and which overexpress ADP should be useful in anti- 
cancer therapy. 

15 In the context of this disclosure, the following terms will be defined as follows unless 

otherwise indicated: 

"Naturally-occurring" as applied to an object such as a polynucleotide, polypeptide, 
or virus means that the object can be isolated from a source in nature and has not been 
intentionally modified by a human. 

20 "Neoplastic cell" means a cell which exhibits an aberrant growth phenotype 

characterized by a significant loss of control of cell proliferation and includes actively 
replicating cells as well as cells in a temporary non-replicative resting state (G, or Gj). A 
neoplastic cell may have a well-differentiated phenotype or a poorly-differentiated phenotype 
and may comprise a benign neoplasm or a malignant neoplasm. 

25 "Recombinant virus" means any viral genome or virion that is different than a wild- 

type virus due to a deletion, insertion, or substitution of one or more nucleotides in the wild- 
type viral genome. The recombinant virus can have changes in the number of amino acid 
sequences encoded and expressed or in the amount or activity of proteins expressed by the 
virus. In particular, the term includes recombinant viruses generated by the intervention of a 

30 human. 

"Replication-competent" as applied to a vector means that the vector is capable of 
replicating in normal and/or neoplastic cells. As applied to a recombinant virus, "replication- 
competent" means that the virus exhibits the following phenotypic characteristics in normal 
and/or neoplastic cells: cell infection; replication of the viral genome; and production and 
35 release of new virus particles; although one or more of these characteristics need not occur at 
the same rate as they occur in the same cell type infected by a wild-type vims, and may occur 
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at a faster or slower rate. Where the recombinant virus is derived from a virus such as 
adenovirus that lyses the cell as part of its life cycle, it is preferred that at least 5 to 25% of Ac 
cells in a cell culture monolayer are dead 5 days after infection. Preferably, a replication- 
competent virus infects and lyses at least 25 to 50%, more preferably at least 75%, and most 
5 preferably at least 90% of the cells of the monolayer by 5 days post infection (p.i.). 

"Replication-defective" as applied to a recombinant virus means the virus is incapable 
of, or is greatly compromised in, replicating its genome in any cell type in the absence of a 
complementing replication-competent virus. Exceptions to this are cell lines such as 293 cells 
that have been engineered to express adenovirus E1A and E1B proteins. 

1 0 "Replication-restricted" as applied to a vector of the invention means the vector 

replicates better in a dividing cell, i.e. either a neoplastic cell or a non-neoplastic, dividing 
cell, than in a cell of the same type that is not neoplastic and/or not dividing, which is also 
referenced herein as a normal, non-dividing cell. Preferably, a replication-restricted virus 
kills at least 10% more neoplastic cells than normal, non-dividing cells in cell culture 

1 5 monolayers of the same size, as measured by the number of cells showing cytopathic effects 
(CPE) at 5 days p.i. More preferably, between 25% and 50%, and even more preferably, 
between 50% and 75% more neoplastic than normal cells are killed by a replication-restricted 
virus. Most preferably, a replication-restricted adenovirus kills between 75% and 100% more 
neoplastic than normal cells in equal sized monolayers by 5 days p.i. 

20 In one embodiment the invention provides a vector that is replication-competent in 

neoplastic cells and which overexpresses an ADP. Vectors useful in the invention include but 
are not limited to plasmid-expression vectors, bacterial vectors such as Salmonella species 
that are able to invade and survive in a number of different cell types, vectors derived from ' 
DNA viruses such as human and non-human adenoviruses, adenovirus associated viruses 

25 (AAVs), poxviruses, herpesviruses, and vectors derived from RNA viruses such as 

retroviruses and alphaviruses. Preferred vectors include recombinant viruses engineered to 
overexpress an ADP. Recombinant adenoviruses are particularly preferred for use as the 
vector, especially vectors derived from Adl, Ad2, Ad5 or Ad6. 

Vectors according to the invention overexpress ADP. As applied to recombinant Ad 

30 and AAV vectors, the term "overexpresses ADP" means that more ADP molecules are made 
per viral genome present in a dividing cell infected by the vector than expressed by any 
previously known recombinant adenoviral vector or AAV in a dividing cell of the same type. 
As applied to other, non-adenoviral vectors, "overexpresses ADP" means that the virus 
expresses sufficient ADP to lyse a cell containing the vector. 

35 Vectors overexpressing ADP can be prepared using routine methodology. See, eg., 

A Laboratory Cloning Manual, 2nd Ed, vol. 3, Sambrook et al., eds., Cold Spring Harbor 
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Laboratory Press, 1989. For example, a polynucleotide encoding the ADP can be cloned into 
a plasmid expression vector known to efficiently express heterologous proteins in mammalian 
cells. The polynucleotide should also include appropriate termination and polyadenylation 
signals. Enhancer elements may also be added to the plasmid to increase the amount of ADP 
5 expression. Viral vectors overexpressing ADP can be prepared using similar materials and 
techniques. 

Where the virus is a recombinant adenovirus, overexpression of ADP can be achieved 
in a multitude of ways. In general, any type of deletion in the E3 region that removes a splice . 
site for any of the E3 mRNAs will lead to overexpression of the mRNA for ADP, inasmuch 

10 as more of the E3 pre-mRNA molecules will be processed into the mRNA for ADP. This is 
exemplified in the KD1, KD3, GZ1 and GZ3 vectors (SEQ ID NOS:l-4) whose construction 
is described below. Other means of achieving overexpression of ADP in Ad vectors include, 
but are not limited to: insertion of pre-mRNA splicing and cleavage/polyadenylation signals 
at sites flanking the gene for ADP; expression of ADP from another promoter, e.g. the human 

15 cytomegalovirus promoter, inserted into a variety of sites in the Ad genome; and insertion of 
the gene for ADP behind the gene for another Ad mRNA, together with a sequence on the 5* 
side of the ADP sequence that allows for internal initiation of translation of ADP, e.g. the Ad 
tripartite leader or a viral internal ribosome initiation sequence. 

The ADP expressed by a vector according to the invention is any polypeptide 

20 comprising a naturally-occurring full-length ADP amino acid sequence or variant thereof that 
confers upon a vector expressing the ADP the ability to lyse a cell containing the vector such 
that replicated copies of the vector are released from the infected cell. A preferred full-length 
ADP comprises the ADP amino acid sequence encoded by Adl, Ad2, Ad5 or Ad6. These 
naturally-occurring ADP sequences are set forth in SEQ ID NOS:5-8, respectively. ADP 

25 variants include fragments and deletion mutants of naturally-occurring adenovirus death 
proteins, as well as full-length molecules, fragments and deletion mutants containing 
conservative amino acid substitutions, provided that such variants retain the ability, when 
expressed by a vector inside a cell, to lyse the cell. 

Conservative amino acid substitutions refer to the interchangeability of residues 

30 having similar side chains. Conservatively substituted amino acids can be grouped according 
to the chemical properties of their side chains. For example, one grouping of amino acids 
includes those amino acids having neutral and hydrophobic side chains (A, V, L, I, P, W, F, 
and M); another grouping is those amino acids having neutral and polar side chains (G, S, T, 
Y, C, N, and Q); another grouping is those amino acids having basic side chains (K, R, and 

35 H); another grouping is those amino acids having acidic side chains (D and E); another 
grouping is those amino acids having aliphatic side chains (G, A, V, L, and I); another 
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grouping is those amino acids having aliphatic-hydroxyl side chains (S and T); another 
grouping is those amino acids having amine-containing side chains (N, Q, K, R, and H); 
another grouping is those amino acids having aromatic side chains (F, Y, and W); and another 
grouping is those amino acids having sulfur-containing side chains (C and M). Preferred 
5 conservative amino acid substitutions groups are: R-K; E-D, Y-F, L-M; V-I, and Q-H. 

As used herein, an ADP variant can also include modifications of a naturally- 
occurring ADP in which one or more amino acids have been inserted, deleted or replaced with 
a different amino acid or a modified or unusual amino acid, as well as modifications such as 
glycosylation or phosphorylation of one or more amino acids so long as the ADP variant 

1 0 containing the modified sequence retains cell lysing activity. 

As described below, the inventors herein performed a structure-function analysis of 
ADP that defined specific domains in ADP required to promote cell death. Using this 
information, when combined with known recombinant DNA and cloning methodology, it is 
believed the skilled artisan can readily construct ADP variants of a naturally-occurring 

15 adenovirus death protein and test them for cell lysing activity. A preferred ADP deletion 
mutant comprises an ADP amino acid sequence from any of the deletion mutants d/716, 
d/715, dll\4 and whose ADP sequences are set forth in SEQ ID NOS:9-12, 
respectively). 

Where the vector is derived from a virus, it is preferred that the virus lack expression 

20 of one or more viral proteins involved in avoiding host anti-viral defenses such as immune- 
mediated inflammation and/or apoptosis of infected cells. For example, adenovirus contains a 
cassette of genes that prevents killing of Ad-infected cells by the immune system (Wold et al., 
Semin. Virol, 1998(8:515-523, 1998). The E3-14.7K protein and the E3 RID (Receptor 
Internalization and Degradation) protein, which is a complex consisting of RIDa and RIDP, 

25 inhibit apoptosis of Ad-infected cells induced by tumor necrosis factor (TNF) and the Fas 
ligand which are expressed on, or secreted by, activated macrophages, natural killer (NK) 
cells, and cytotoxic lymphocytes (CTLs) (Tollefcon et al., Nature 392:127-730, 1998). The 
E3-gpl9K protein inhibits CTL-killmg of infected cells by blocking transport of MHC class I 
antigens to the cell surface (Wold et aL, supra). Thus, it is believed that infection of tumor 

30 cells by such viral vectors will stimulate infiltration of inflammatory cells and lymphocytes 
into the tumor, and will not prevent infected tumor cells from apoptosis induced by cytolytic 
cells of the immune system, or against apoptosis inducing cytokines. For example, it is 
known that when mice are infected with Ad mutants lacking the E3 gpl9K, RID and 14.7K 
proteins there is a dramatic increase (as compared to E3 -positive Ad) in infiltration of 

35 inflammatory cells and lymphocytes into the infected tissue (Sparer et al., J. Virol. 70-243 1- 
2439, 1996). A similar infiltration of tumors infected by an ADP-expressing viral vector of 
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the invention would be expected to further promote destruction of the tumor by adding an 
immune system attack to the ADP-mediated killing activity. For example, it is believed that 
the viral infection will stimulate formation of tumor-specific CTL's that can kill neoplastic 
cells not only in the tumor but also ones that have metastasized. In addition, it is also 
5 expected that vector-specific CTL's will be generated which could attack vector-infected cells 
if the vector spreads away from the tumor into normal cells. Because viral vectors 
overexpressing ADP will spread rapidly through the tumor, it is believed these immune 
mechanisms will have little effect on spread of the vector. 

Where the vector is a recombinant adenovirus, it is preferred that the adenovirus lack 
10 expression of each of the E3 gpl9K, RID, and 14.7K proteins. By "lack expression" and 
"lacking expression" of a protein(s), it is meant that the viral genome contains one or more 
mutations that inactivates expression of a functional protein, i.e., one having all the functions 
of the wild-type protein. The inactivating mutation includes but is not limited to substitution 
or deletion of one or more nucleotides in the encoding gene(s) that prevents expression of 
1 5 functional transcripts or that results in transcripts encoding nonfunctional translation products. 
A particularly preferred way to inactivate expression of the Ad E3 gpl9K, RID, and 14.7K 
proteins is by deleting the E3 region containing the genes encoding these proteins. 
Preferably, one or both of the E3 genes encoding the E3 6.7K and 12.5K proteins are also 
deleted because, as discussed in the Examples below, it is believed that deletion of most or all 
20 of the E3 genes other than the ADP gene facilitates overexpression of ADP mRNA by 
reducing competition for splicing of the major late pre-mRNAs. Preferred Ad vectors 
containing an E3 deletion that overexpress ADP are GZ1 (SEQ ID NO:3) and GZ3 (SEQ ID 
NO:4), whose construction and properties are described in the Examples below. 

The invention also provides ADP-expressing vectors whose replication is restricted to 
25 dividing cells. Any means known to provide such a replication-restricted phenotype may be 
used. For example, WO 96/40238 describes microbes that preferentially invade tumor cells 
as well as methods for identifying and isolating bacterial promoters that are selectively 
activated in tumors. It is also contemplated that expression of one or more vector proteins 
essential for replication can be placed under the control of the promoter for a cellular gene 
30 whose expression is known to be upregulated in neoplastic cells. Examples of such genes 
include but are not limited to: the breast cancer markers mammaglobin (Watson et ah, 
Oncogene 70:817-824, 1998); BRCA1 (Norris et aL f / Biol Chenu 270:227T7-2m2 9 1995) 
her2/neu (Scott et al., /. BioL Chem. 269: 19848- 19858, 1994); prostate specific antigen (U.S. 
Patent 5,698,443); surfectant protein B for lung alveoli (Yan et al., y. BioL Chem. 270:24852- 
35 24857, 1995); factor VII for liver (Greenberg et al., Proa Natl Acad ScL USA 02:12347- 
12351, 1995); and survivin for cancer in general (Li et al., Nature JPtf:580-584). Where the 
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vector is an adenovirus, it is contemplated that such tumor-specific promoters can be 
substituted for the E4 promoter. Because E4 gene products are essential for Ad replication, 
placing their expression under the control of a tumor-specific promoter should restrict 
replication of the vector to tumor cells in which the promoter is activated. 
5 Another strategy for restricting replication of ADP-expressing Ad vectors to 

neoplastic cells is exemplified by the KD1 (SEQ ID NO:l), KD2 (SEQ ID NO:13) and KD3 
(SEQ ID NO:2) vectors, whose construction and properties are described in the Examples 
below. This strategy exploits a pre-existing Ad5 mutant in the El A gene, named dll 101/1 107 
(Howe et al., Proa Natl Acad. ScL, 57:5883-5887, 1990), also referred to herein as dfOl/07, 

10 and which can only grow well in cancer cells. The role of El A is to drive cells from the Go 
and G| phases of the cell cycle into S-phase. This is achieved by two mechanisms, one 
involving pRB (and family members), and the other involving p300 and the related protein 
CBP (DePinho, RJL, Nature 597:533-536, 1998). One domain in El A binds members of the 
pRB family. pRB normally exists in the cell as a complex with the transcription factor E2F-1 

15 and E2F family members (E2F), tethered via E2F to E2F binding sites in promoters of cells 
expressed in S-phase. Here, pRB acts as a transcriptional co-repressor. El A binding to pRB 
relieves this repression, and causes the release of E2F from pRB/E2F complexes. Free E2F 
then activates promoters of genes expressed in S-phase, e.g. thymidine kinase, ribonucleotide 
reductase, etc. Another domain in El A binds the p300/CBP transcription adaptor protein 

20 complex. p300/CBP is a transcriptional co-activator that binds many different transcription 
factors and accordingly is targeted to promoters. p300/CBP has intrinsic histone 
acetyltransferase activity. El A binding to p300/CBP is believed to inhibit this histone 
acetyltransferase activity, allowing acetylation of histories and repression of transcription 
(Chakravarti et al., Cell P<f:393-403, 1999; Hamamori et al., Cell 9*405-413, 1999). 

25 Conceivably, some of the genes that are repressed as a result of El A interacting with 

p300/CBP to play a role in blocking the cell cycle, although this is not known. Cancer cells 
are cycling, so they have free E2F and presumably some p300/CBP-regulated genes are 
repressed. Consistent with these ideas, El A must bind both p300/CBP and the pRB family in 
order to transform primary cells to a constitutively cycling state (Howe et al., supra). The 

30 mutant dIO 1/07 lacks both the p300/CBP- and pRB-binding domains and, as expected, it 

replicates very poorly in non-dividing "normal" cells or serum-starved cancer cells, but well 
in growing cancer cells. As described below, the growth of the KD1 and KD3 vectors, which 
contain the J/01/07 El A mutation, is very much better in dividing cancer cells as compared to 
non-dividing cells* Because the <ff01/07 mutant is completely defective in oncogenic 

35 transformation of rat cells (Howe et la., supra), vectors according to the invention that contain 
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this El A mutation cannot induce cancer in humans (remote as that may be) through an El A- 
dependent mechanism. 

The invention also includes vectors overexpressing ADP whose replication is 
restricted to specific tissues by placing expression of one or more proteins essential for 
5 replication undo- the control of a tissue specific promoter and/or a tumor specific promoter. 
A number of tissue-specific and/or tumor specific promoters have been described in the art 
Non-limiting examples include the surfactant protein B promoter, which is only active in cells 
containing the TTF1 transcription factor (i.e., type n alveolar cells (Yan et al., supra)), as 
described in U.S. Patent 5,466,596 to Breitman et al., which directs gene expression 

1 0 specifically in cells of endothelial lineage; prostate specific antigen which is expressed in 
prostate cells (Rodriguez et al., supra); human telomerase protein (hTERT) promoter (see, 
e.g., U.S. Patent No. 6,054,575); and human alpha-lactalbumin gene which is expressed in 
breast cancer cells (Anderson et al., Gene Therapy £854-864, 1999). Many other tissue- 
specific, tumor specific, or tissue-preferred enhancer/promoters have been reported (Miller 

15 and Whelan, Human Gene Therapy 5:803-815, 1997). As exemplified with the surfactant 

protein B promoter in Examples 6 and 10, vectors expressing tissue-specific promoters would 
be expected to show tissue specificity in viral replication, viral spreading, cell lysis, and 
tumor suppression. 

Replication of vectors according to the invention can also be controlled by placing 

20 one or more genes essential for vector replication under the control of a promoter that is 
activated by an exogenous inducing agent, such as metals, hormones, antibiotics, and 
temperature changes. Examples of such inducible promoters include but are not limited to 
metallothionein promoters, the glucocorticoid promoter, the tetracycline response promoter, 
and heat shock protein (hsp) promoters such as the hsp 65 and 70 promoters. 

25 The invention also provides compositions comprising a recombinant vector that 

overexpresses ADP in an amount effective for promoting death of neoplastic cells and a 
method comprising administering a therapeutically effective amount of the vector to a 
neoplastic cell in a patient It is believed the compositions and methods of the present 
invention are useful for killing neoplastic cells of any origin and include neoplastic cells 

30 comprising tumors as well as metastatic neoplastic cells. 

It is also contemplated that ADP-expressing viral vectors can be administered to 
neoplastic cells along with a replication-defective vims that expresses an anti-cancer gene 
product For example, many replication-defective El" Ad vectors for use in cancer therapy 
are well characterized A limitation of replication-defective vectors is that they only 

35 synthesize the therapeutic protein in the cell they initially infect, they cannot spread to other 
cells. Also, since the genome does not replicate, transcription can only occur from the input 
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genomes, and this could be as low as one copy per cell. In contrast, the genome of 
replication-competent Ad vectors are amplified by about 10 4 in the cell that was initially 
infected, providing more templates for transcription. More amplification is achieved as the 
vector spreads to other cells. By combining replication-defective viral vectors expressing an 
5 anti-cancer gene product with replication-competent viral vectors described herein, it is 
expected that the result will be template amplification and rapid spread of both vectors to 
surrounding cells. For example, with Ad-based vectors, the burst size for each vector should 
be large, ~10 4 PFU/cell, so the probability of co-infection of surrounding cells by both vectors 
will be high. Thus, both the replication-competent and replication-defective vectors should 

1 0 spread simultaneously through the tumor, providing even more effective anti-cancer therapy. 

As an alternative method of delivering an anti-cancer gene product with an ADP 
overexpressing Ad vector, the anti-cancer gene can be engineered into any of die ADP 
overexpressing replication-competent vectors described herein, in order to provide both the 
ADP and the anti-cancer function in a single vector. The anti-cancer gene can be engineered 
' 1 5 into any appropriate location of the vector, as can be easily determined by the skilled artisan. 
For example, the anti-cancer gene can be engineered into the E3 region. 

Expression of the anti-cancer gene product encoded by the replication-defective 
vector can be under the control of either constitutive, inducible or cell-type specific 
promoters. The anti-cancer gene product can be any substance that promotes death of a 

20 neoplastic cell. The term "gene product" as used herein refers to any biological product or 
products produced as a result of the biochemical reactions that occur under the control of a 
gene. The gene product can be, for example, an RNA molecule, a peptide, a protein, or a 
product produced under the control of an enzyme or other molecule that is the initial product 
of the gene, i.e., a metabolic product. For example, a gene can first control the synthesis of an 

25 RNA molecule which is translated by the action of ribosomes into a prodrug converting 

enzyme which converts a nontoxic prodrug administered to a cancer patient to a cell-killing 
agent; the RNA molecule, enzyme, and the cell-killing agent generated by the enzyme are all 
gene products as the term is used here. Examples of anti-cancer gene products include but are 
not limited to cell-killing agents such as apoptosis-promoting agents and toxins; prodrug 

30 converting enzymes; angiogenesis inhibitors; and immunoregulatory molecules and antigens 
capable of stimulating an immune response, humoral and/or cellular, against the neoplastic 
cell. 

Apoptosis-promoting agents include but are not limited to the pro-apoptotic members 
of the BCL-2 family such as BAX, BAD, BED and BIK, as well as antisense molecules which 
35 block expression of anti-apoptotic members of the BCL-2 family. Examples of 

immunoregulatory molecules are cytokines such as tumor necrosis factor, Fas/Apo 1/CD95 
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ligand, tumor necrosis factor related apoptosis inducing ligand, interleukins, macrophage 
activating factor and interferon y. Angiogenesis inhibitors include but are not limited to 
endostatin and angiostatm. Toxins include but are not limited to tumor necrosis factor, 
lymphotoxin, the plant toxin ricin, which is not toxic to humans due to the lack of ricin 
5 receptors in animal cells, and the toxic subunit of bacterial toxins. Examples of pro-drug 
converting enzymes and pro-drug combinations are described in WO 96/40238 and include 
thymidine kinase and acyclovir or gancyclovir; and bacterial cytosine deaminase and 5- 
fluorocytosine. 

The therapeutic or pharmaceutical compositions of the present invention can be 

1 0 administered by any suitable route known in the art including for example by direct injection 
into a tumor or by other injection routes such as intravenous, subcutaneous, intramuscular, 
transdermal, intrathecal and intracerebral. Administration can be either rapid as by injection 
or over a period of time as by slow infusion or administration of slow release formulation. 
For treating tissues in the central nervous system, administration can be by injection or 

15 infusion into the cerebrospinal fluid (CSF). When it is intended that a recombinant vector of 
the invention be administered to cells in the central nervous system, administration can be 
with one or more agents capable of promoting penetration of the vector across the blood-brain 
barrier. Preferably, vectors of the invention are administered with a carrier such as liposomes 
or polymers containing a targeting moiety to limit delivery of the vector to targeted cells. 

20 Examples of targeting moieties include but are not limited to antibodies, ligands or receptors 
to specific cell surface molecules. 

Compositions according to the invention can be employed in the form of 
pharmaceutical preparations. Such preparations are made in a manner well known in the 
pharmaceutical art One preferred preparation utilizes a vehicle of physiological saline 

25 solution, but it is contemplated that other pharmaceutical^ acceptable carriers such as 

physiological concentrations of other non-toxic salts, five percent aqueous glucose solution, 
sterile water or the like may also be used. It may also be desirable that a suitable buffer be 
present in the composition. Such solutions can, if desired, be lyophilized and stored in a 
sterile ampoule ready for neconstitution by the addition of sterile water for ready injection. 

30 The primary solvent can be aqueous or alternatively non-aqueous. 

The carrier can also contain other pharmaceutically-acceptable excipients for 
modifying or maintaining the pH, osmolality, viscosity, clarity, color, sterility, stability, rate 
of dissolution, or odor of the formulation. Similarly, the carrier may contain still other 
pharmaceutically-acceptable excipients for modifying or maintaining release or absorption or 

35 penetration across the blood-brain barrier. Such excipients are those substances usually and 
customarily employed to formulate dosages for parenteral administration in either unit dosage 
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or multi-dose form or for direct infusion into the cerebrospinal fluid by continuous or periodic 
infusion. 

It is also contemplated that certain formulations containing ADP-expressing vectors 
are to be administered orally. Such formulations are preferably encapsulated and formulated 
5 with suitable carriers in solid dosage forms. Some examples of suitable carriers, excipients, 
and diluents include lactose, dextrose, sucrose, sorbitol, martnitol, starches, gum acacia, 
calcium phosphate, alginates, calcium silicate, microcrystallme cellulose, 
polyvinylpyrrolidone, cellulose, gelatin, syrup, methyl cellulose, methyl- and 
propylhydroxybenzoates, talc, magnesium, stearate, water, mineral oil, and the like. The 

1 0 formulations can additionally include lubricating agents, wetting agents, emulsifying and 
suspending agents, preserving agents, sweetening agents or flavoring agents. The 
compositions may be formulated so as to provide rapid, sustained, or delayed release of the 
active ingredients after administration to the patient by employing procedures well known in 
the art. The formulations can also contain substances that diminish proteolytic degradation 

1 5 and promote absorption such as, for example, surface active agents. 

The specific dose is calculated according to the approximate body weight or body 
surface area of the patient or the volume of body space to be occupied The dose will also be 
calculated dependent upon the particular route of administration selected. Further refinement 
of the calculations necessary to determine the appropriate dosage for treatment is routinely 

20 made by those of ordinary skill in the art. Such calculations can be made without undue 

experimentation by one skilled in the art Exact dosages are determined in conjunction with 
standard dose-response studies. It will be understood that the amount of the composition 
actually administered will be determined by a practitioner, in the light of the relevant 
circumstances including the condition or conditions to be treated, the choice of composition to 

25 be administered, the age, weight, and response of the individual patient, the severity of the 
patient's symptoms, and the chosen route of administration. Dose administration can be 
repeated depending upon the pharmacokinetic parameters of the dosage formulation and the 
route of administration used. 

The invention also contemplates passively immunizing patients who have been 

30 treated with a viral vector overexpressing ADP. Passive immunization can include 

administering to the patient antiserum raised against the viral vector, or gamma-globulin or 
vector-specific purified polyclonal or monoclonal antibodies isolated from the antiserum. 
Preferably, the patient is passively immunized after a time period sufficient for the viral 
vector to replicate in and spread through the tumor. 

35 Preferred embodiments of the invention are described in the following examples. 

Other embodiments within the scope of the claims herein will be apparent to one skilled in the 
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art from consideration of the specification or practice of the invention as disclosed herein. It 
is intended that the specification, together with the examples, be considered exemplary only, 
with the scope and spirit of the invention being indicated by the claims which foliow the 
examples. 

5 Example 1 

This example illustrates the construction and characterization of the KD1 and KD3 
anti-cancer vectors. 

To construct KD1, the inventors deleted the entire E3 region of a unique plasmid, 
leaving behind only a unique PacI site for cloning. The starting plasmid was pCRJQ\ 

1 0 purchased from Invitrogen, containing the Ad5 BamHIA fragment having a deletion of all the 
E3 genes; the E3 deletion is identical to that for KD1 and GZ3, the sequences of which are 
given in SEQ ED NO: 1 and SEQ ID NO:4, respectively. The ADP gene from Ad5 was cloned 
into the PacI site, then built into the E3 region of the genome of the Ad5 El A mutant named 
J/01/07. This was done by co-transfecting into human embryonic kidney 293 cells the 

1 5 aforementioned BamHIA fragment containing the ADP gene together with the overlapping 
EcoRIA restriction fragment obtained from dlO 1/07. Complete viral genomes are formed 
within the cell by overlap recombination between the Ad sequences in the BamHIA fragment 
in the plasmid and the EcoRIA fragment KD3 was constructed in the same way except the 
E3 gene for the 1 2.5K protein was retained in the starting plasmid. A vector named KD2, 

20 which marginally overexpress ADP, was also prepared. Plaques of each recombinant Ad 
were picked, screened, purified, expanded into CsCl-banded stocks, sequenced, titered, and 
characterized. GZ1 and GZ3 are Ad vectors that are identical to KD1 and KD3, respectively, 
except that GZ1 and GZ3 have wild-type El A sequences as found in AD5 or in the Ad5 
mutant <//309. GZ1 and GZ3 were constructed as described for KD1 and KD3 except that the 

25 EcoRIA fragment of Ad5 was used for GZ1 and GZ3. 

KD1 and KD3 were characterized in cell culture by infecting the human A549 lung 
carcinoma cell line with high titer (1-8 x 10 10 plaque forming units [PFU] per ml) virus stocks 
of one of these recombinant vectors, or with one of the control viruses <#01/07, <#309, d/327, 
and Ad5 (wt). Fifty PFU per cell were used for each virus. The descriptions of these viruses 

30 as well as some other viruses used in these examples are presented in Table 1. 
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Using a polymerase chain reaction (PCR)-based protocol, an in-frame stop codon was 
introduced into the gene for the E3-gpl9K protein in the E3 region of the Ad5 mutant dl309 
(Jones and Shenk, Cell 77:683-689, 1979). The mutagenesis was conducted using a Sunl- 
Bstl 1071 fragment, nucleotides 28,390 to 29,012 in the Ad5 genome, which was then 
5 substituted for the equivalent fragment in J/309. dI0l/07 is the parent for KD1 and KD3. In 
turn, the Ad5 mutant named J/309 is the parent of J/01/07, i.e. J/309 is identical to J/01/07 
except that J/309 does not have the El A mutation. Both J/01/07 and J/309 have deletions of 
the genes for the E3RDDa,RIDp and 14.7K proteins but retain the gene for ADP. The Ad5 
mutant J/327 has wild-type El A, it lacks the gene for ADP, and its lacks all other E3 genes 

1 0 except the one for the 12.5K protein. 

At 24 and 36 hours post-infection (h p.i.), proteins were extracted from the A549 cells 
and analyzed for ADP by immunoblot using a rabbit antiserum against ADP (Tollefson et ah, 
J. Virol 66:3633-3642, 1992). The results are shown in Figure 2. Much more ADP was 
detected at 24 and 36 hp.i. in KD1- and KD3 -infected cells than in cells infected with 

1 5 J/0 1/07. Also, much more ADP was synthesized by GZ1 and GZ3 than J/309 or the other 
viruses. Most importantly, KD1, KD3, GZ1, and GZ3 expressed much more ADP at 24 h p.i. 
than did J/01/07 or J/309 (Fig. 2). This result is consistent with an observation discussed 
below that the cells infected with KD1 , KD3, GZ1 , or GZ3 lyse fester, and that these viruses 
spread from cell to cell faster than J/01/07 or J/309. It is noteworthy that KD1, KD3, GZ1, 

20 and GZ3 express much more ADP at 24 and 36 h p.i. than the Ad5 mutant J/1520 (Fig. 2); 
J/1520 is the original name given to ONYX-015 (Heise et al., Nature Medicine 5:639-645, 
1 997). As expected, no ADP was detected in cells infected with /wn734. 1 (Fig. 2), a mutant 
that lacks amino acids 1 to 48 in ADP (Tollefson et al., /. Virol. 70:2296-2306, 1996). 
Expression of the El A proteins by J/01/07, KD1, KD2, and KD3 was slightly less than by 

25 Ad5, J/309, or J/327, and as expected from the J/01/07 deletion, the proteins were smaller 
(Fig. 3 A). J/327 is isogenic with J/324 (Thimmappaya et al., 1982 Cell 57:543-51, 1983), 
and it lacks the gene for ADP and all other E3 proteins except the 12.5K protein. 

The amount of ADP detected in the KD1 and KD3 infected cells is significantly 
higher than the amount detected in the J/309 infected cells (Fig. 2). If one takes into 

30 consideration the fact that the viruses with the El A mutation replicate somewhat slower, as 
evidenced in by the delayed appearance of the late proteins (Fig. 3B), it is clear that KD1 and 
KD3 express much more ADP per viral genome present in the cell than J/309. This finding is 
supported by the feet that when A549 cells are coinfected with a virus containing the El A 
mutation and J/327, which lacks ADP but has wild-type El A, the replication rates of die E1A 

35 mutant viruses speed up, as indicated by earlier appearance of late proteins (compare Figs. 3B 
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and 3D). Thus, dllll complements the El A mutation. In conclusion, these experiments 
demonstrate that ADP is dramatically overexpressed by KD1, KD3, GZ1, and GZ3. ADP is 
marginally overexpressed by KD2 (not shown). 

Example 2 

5 This example illustrates that KD1 and KD3 lyse cells more rapidly and spread from 

cell-to cell faster than other adenoviruses. 

The ability of KD1 and KD3 to lyse cells was examined by a trypan blue exclusion 
cell viability assay which was performed essentially as described by Tollefson et al., 7. Virol 
70:2296-2306, 1996. In brief, A549 cells were mock-infected or infected with 20 PFU/cell of 

10 KD1, KD3, d/01/07, dlhll or dim: At various days p.i., the number of viable cells was 
determined using a hemocytometer (600 to 1000 cells were counted per time point) and the 
results are shown in Fig. 4. 

Only 25% of the KDl-infected cells and 9% of the KD3-infected cells were alive at 5 
days p.i. as compared to 44% of cells infected with d/01/07, which has the same E1A 

1 5 mutation as KD 1 and KD3. The KD 1 and XD3 vectors also lysed cells fester than d/309, 

which has a wild-type El A region. When infected with rf/327 (ADF, El A*), 94% of the cells 
were alive after 5 days. When cell lysis was estimated by release of lactate dehydrogenase, 
KD1 and KD3 once again lysed cells faster than d/01/07 and tf/309, and dBll caused little 
cell lysis (data not shown). Thus, ADP is required for efficient cell lysis, and over-expression 

20 of ADP increases the rate of cell lysis. 

As another means to measure cell lysis and to examine virus replication in cancer 
cells, separate groups of A549 cells were infected with 20 PFU/cell of KD1, KD3, d701/07, or 
<//309 and the amount of intracellular and extracellular virus was determined by plaque assay 
on A549 cells. At 2 days p.i., the total amount of vims formed in each group was similar, 2-4 

25 x 10 8 PFU/ml, indicating that replication of all the viruses is similar. However, when the ratio 
of extracellular to intracellular virus was calculated, the value for KD1 and KD3 was 2-3 logs 
higher than for Ad5, d/309, or dIOl/01 (data not shown). Thus, virus is released much mare 
rapidly from cells infected with KD1 and KD3, which overexpress ADP, than with viruses 
expressing wild-type amounts of ADP. 

30 The ability of KD1 and KD3 to spread from cell-to-cell was measured in a "cell 

spreading" assay. In this assay monolayers of A549 cells in a 48 well culture dish were mock- 
infected or infected with 10 3 , 10" 2 , 10*', 10°, or 10 PFU/cell of d/327, dl3Q9 f Ad5, rf/01/07, 
KD1 or KD3. At low PFU/cell, the viruses must go through two or three rounds of 
replication in order to infect every cell in the monolayer. At 1 .0 and 10 PFU/cell, the 

35 monolayer should be destroyed by the virus that initially infected the cells. To assess die 
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amount of spread in the monolayers by 7 days p.i., crystal violet, which stains live cells but 
not dead cells, was added to the monolayers. The results are shown in Fig. 5. 

Remarkably, at 7 days p.i., the monolayer was virtually eliminated by KD1 and KD3 
at 10° PFU/cell, whereas 1.0 PFU/cell was required with rf/01/07, J/309 and Ad5. This result 
5 attests to the potency of ADP in mediating cell lysis and virus spread in A549 cells. KD1 and 
KD3 are also more effective that dlO 1/07 in killing other types of human cancer cell lines 
(most purchased from the American Type Culture Collection [ATCC]) as determined in this 
cell spreading assay. KD1 and/or KD3 killed HeLa (cervical carcinoma), DU145 (prostate), 
and pC3 (prostate) cells at 10" 2 PFU/cell, ME-180 (cervix) and Hep3B (liver) at 10" 1 PFU/cell, 

10 and Ul 18 (glioblastoma) and U373 (glioblastoma) at 10 PFU/cell. From 10- to 100-fold 
more rf/01/07 was required to kill these cells (data not shown). These results indicate that 
KD1 and KD3 may be effective against many types of cancer. 

An important aspect of the finding that ADP overexpressing vectors lyse cells at very 
low multiplicities of infection is that the multiplicity of infection in human tumors is likely to 

15 be low at sites distal to the sight of vector injection or distal to blood vessels that cany the 
vector to the tumor. Thus, ADP overexpressing vectors have an advantage over vectors that 
express less ADP or no ADP at all. 

Example 3 

This example illustrates that KD1 and KD3 replicate poorly in non-growing non- 
20 cancerous cells. The replication phenotype of KD1 and KD3 was evaluated using "normaF 
HEL-299 human fibroblast cells, either growing in 10% serum or rendered quiescent using 
0.1% serum. All Ads should replicate well in growing cells, but viruses with the dlOl/07 El A 
mutation should do poorly in quiescent cells because El A is required to drive them out of G 0 . 
J/309, which has wild-type El A, should replicate well in both growing and growth-arrested 
25 cells. 

Cells were infected with 100 PFU/cell of KD1, KD3, #01/07, or d/309. At different 
days p.i., virus was extracted and titered. In 10% serum, KD1, KD3, and dlOl/07 replicated 
well, reaching titers of 10 6 -10 7 PFU/ml, only slightly less than rf/309 (Fig. 6). However, in 
quiescent cells, replication of KD1, KD3, and rf/01/07 was 1.5-2 logs lower than in growing 

30 cells, ranging from 10 4 to 2 x 10 5 PFU/ml. The titer of d/309 reached 10 7 PFU/ml, nearly the 
level achieved in growing cells. At 10 days p.i., quiescent HEL-299 cell monolayers infected 
with 100 PFU/cell of KD1 , KD3, or rf/01/07 were intact, whereas those infected with J/309 or 
<fl327, which have wild-type E1A, showed strong typical Ad cytopathic effect indicative of 
cell death (data not shown). Thus, replication of KD1 and KD3 is severely restricted to 

35 growing cell lines. 



WO 01/04282 



PCT/USOO/18971 



29 

The restriction associated with the dlOXIQl El A mutation was also tested in primary 
human cells (purchased from Clonetics) growing as monolayers. Bronchial epithelial cells 
(Fig. 7) and small airway epithelial cells were not killed by 10 PFU/cell of KD1, KD3> or 
rf/01/07 at 5 days p.i., whereas they were killed by 10 PFU/cell of <//309 or d/327 (data not 
5 shown). Lung endothelial cells also were not killed after 10 days by KD1, KD3, or dlOl/07 at 
10 PFU/cell, but they were killed by 1 PFU/cell of <U309. These monolayers were 
subconfluent when initially infected, then grew to confluency. The exciting result here is that 
although these primary cells were growing, they did not support replication in this time frame 
and were not killed by KD1 or KD3. Thus, it is believed these vectors will be restricted to 
1 0 cancerous cells, and will have little to no effect on cells such as basal cells that are normally 
dividing in the body. In addition, it is unlikely that KD1 and KD3 will affect dividing 
leukocytes because such cells are poorly infected by Ad. 

In summary, the above experiments demonstrate that KD1 and KD3 lyse cancer cells, 
spread from cell-to-cell rapidly, and replicate poorly in quiescent and non-cancerous cells. 
15 These properties should make them useful in anti-cancer therapy. 

Example 4 

This example illustrates that KD1 and KD3 inhibit the growth of human tumors in an 
animal model. 

We could not evaluate mouse or rat tumors in normal mice or rats because they are 

20 totally non-permissive. Human cancer cell lines growing in nude mice have been used by 

Onyx Pharmaceuticals (Richmond, CA) to evaluate the efficacy of ONYX-015, an Ad vector 
lacking expression of the E1B 55 kDa protein (Heise et al., Nature Med. 5:639-645, 1997). 
We have found that A549 cells, which were used in many of our cell culture studies, form 
excellent rapidly growing solid tumors when injected subcutaneously into nude mice. The 

25 average tumor reaches ca. 500 ul in four weeks, and is encapsulated, vascularized, and 
attached to the mouse skin (usually) or muscle. 

Nude mice were inoculated into each hind flank with 2 x 10 7 A549 cells. After 1 
week tumors had formed, ranging in size from about 20 ul to 50 ul. Individual tumors were 
injected three days later, and at subsequent weeks for 4 weeks (total of 5 injections), with 50 

30 ul of buffer or 50 ul of buffer containing 5 x 10 7 PFU of dim, <£«)l/07, KD1, KD3, or 

pm734.1, with a total virus dose per tumor of 3 x 10 8 PFU. The mutant pmlZAA lacks ADP 
activity due to two nonsense mutations in the gene for ADP, but all other Ad proteins are 
expected to be synthesized at wild-type levels (Tollefson et al., Jl Virol 70:2296-2306, 1996). 
The efficacy of each virus (or buffer) was tested on six tumors. At weekly intervals, the 

35 length (L) and width (W) of tumors were measured using a Mitutoyo digital caliper. Tumor 
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volumes were calculated by multiplying L x W x W/2. This value was divided by the tumor 
volume at the time of the initial virus injection, the fold-increase in tumor growth was 
calculated, and the average for the six tumors was graphed. 

As shown in Fig. 8A, tumors that received buffer continued to grow, increasing about 
5 14-fold by 5 weeks. In contrast, tumors injected with J/309, which expresses normal amounts 
of ADP and lacks the E3 RID and 14.7K and proteins, only grew about 23-fold by 5 weeks. 
With /7to734.1, which lacks ADP, the tumors grew as well as those that received buffer. 
Thus, J/309 markedly decreases the rate of tumor growth, and ADP is required for this 
decrease. Tumors inoculated with J/01/07 grew about 8-fold over 5 weeks. Since J/01/07 is 
10 identical to J/309 except for the El A mutation, this result indicates that the El A mutation 
significantly reduces the ability of Ad to prevent growth of the tumors. This effect is 
probably due to a reduction in virus replication in the tumors resulting in lower ADP 
expression, but it could also reflect other properties of El A in the tumor cells, e.g. the 
inability of the mutant El A proteins to induce apoptosis. Most importantly, tumors 
1 5 inoculated with KD 1 or KD3 only grew about 23-fold. Thus, the overexpression of ADP by 
KD1 and KD3 allows KD1 and KD3 to reduce tumor growth to a rate markedly slower than 
J/01/07 (their parental control virus), and even to a rate similar to that of J/309. 

The finding that KD1 and KD3 are as effective as wild-type Ad (i.e. J/309) in 
reducing the rate of A549 tumor growth is highly significant in the context of cancer 
20 treatment, inasmuch as KD1 and KD3 are restricted to cancer cells whereas wild-type Ad 
does not have such a restriction. 

The tumors in Fig. 8A received five injections of vectors, but only one dose of vector, 
in this case 5 x 10* of each of KD3 or GZ3, is sufficient to significantly reduce the rate of 
A549 tumor growth (Fig. 8B). 
25 We have also found that KD1 and KD3 reduce the rate of growth in nude mice of a 

human liver cancer cell line, Hep3B cells. These cells form rapidly growing tumors that are 
highly vascularized. Nude mice were inoculated into each hind flank with 1 x 10 7 of Hep3B 
cells. After tumors reached about 100 fil, they were injected twice per week for 3 weeks with 
50^1 of buffer or 5 x 10 7 PFU of KD1, KD3, or J/309. There were typically 8-10 tumors per 
30 test virus. The tumor sizes were measured and the fold increase in size at 0 to 3.5 following 
the initial virus injection was graphed as described above for the A549 tumors. Tumors that 
received buffer alone grew 9-fold over 3 weeks and were projected to grow about 12-fold 
over 3 J weeks (after 3 weeks the mice had to be sacrificed because the tumors were 
becoming too large) (Fig. 9). Tumors that received KD1 or KD3 grew about 4-fold, 
35 establishing that KD1 and KD3 reduce the growth of Hep3B tumors in nude mice. Tumors 
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that were injected with J/309 grew 2-fold (Fig. 9). The finding that KDl and KD3 were 
somewhat less effective than J/309 is probably due to the fact that they do not grow as well as 
J/309 in Hep3B cells, as indicated by a cell spread assay in culture (data not shown). In any 
case, the important points are that KDl and KD3 are effective against the Hep3B tumors, and 
5 that they contain the £ 1 A mutation that limits their replication to cancer cells. 

These results point to the potency of ADP as an anti-tumor agent when expressed in 
an Ad vector. It is highly probable that KDl and KD3 will provide significant clinical benefit 
when used to infect tumors growing in humans. 

Examples 

1 0 This example illustrates the use of replication-defective Ad vectors in combination 

with KDl or KD3. 

It is well established that replication-competent (RQ viruses complement replication- 
defective (RD) mutants. That is, when the same cell is infected, the competent virus will 
supply the protein(s) that cannot be made from the mutant genome, and both viruses will 

1 5 grow. To test the ability of KDl and KD3 to complement RD viruses, two RD vectors 
expressing (3-galactosidase were constructed The first, named Ad-P-gal, has a cDNA 
encoding (3-gal under the control of the Rous Sarcoma Virus promoter substituted for the 
deleted El region. Ad-P-gal also has the E3 region deleted, including the gene for ADP. The 
second, named Ad-p-gal/FasL is identical to Ad-P-gal, except that it also expresses murine 

20 FasL from the human cytomegalovirus promoter/enhancer. These vectors were constructed 
by overlap recombination in human 293 cells that constitutively express the Ad E1A and E1B 
genes and complement replication of the El-minus vectors. 

These RD vectors should infect and express P-gal in A549 cells, but should not 
replicate because the El A proteins are lacking. However, the vectors should replicate when 

25 cells are co-infected with RC Ads. To prove this, A549 cells were infected with 10 PFU/cell 
of Ad-P-gal alone, or with 10 PFU/cell of Ad-p-gal plus 10 PFU/cell of KDl, KD3, J/01/07, 
J/309, or J/327. At 2 days p.i., virus was extracted and Ad-p-gal titers determined by P-gal 
expression in A549 cells. The yields are shown in Table 2 below. 
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Table 2 



Virus 


Yield 
(blue plaques per ml) 


Ad-P-gal 


lxlO 2 


Ad-0-gal + KDl 


2xl0 5 


Ad-P-gal + KD3 


3 x10 s 


Ad-p-gal + d/01/07 


4xl0 4 


Ad-0-gal + <ff3O9 


3xl0 5 


Ad-p-gal + d/327 


3.0 x10 s 



The data in Table 2 indicate that the complementing viruses increased the yield of Ad-P-gal 
by about 10 3 . 

5 A key feature of KD1 and KD3 is that they spread from cell-to-cell faster than other 

Ads. Accordingly, they should complement the spread of Ad-P-gal. To test this, an 
infectious center assay was conducted. A549 cells were infected with Ad-P-gal plus KD1, 
KD3, or dlO 1/07. After 2 h, cells were collected, diluted, and seeded onto monolayers of 
fresh A549 cells. After 4 days, the cells were stained with X-gal and the results are shown in 
10 Fig. 10. 

With Ad-P-gal alone, only the originally infected cell (before seeding) should be 
stained, and the vector should not spread to other cells on the seeded monolayer. This was 
indeed the case. In monolayers seeded with A549 cells infected with Ad-P-gal alone (dish 
shown in the top left of Fig. 10A) contained a number of individual blue cells (not visible in 

15 the print); examples are shown in the enlarged view Fig. 10B. However, when the 

monolayers were seeded with A549 cells coinfected with Ad-p-gal and KD1 or KD3, there 
were numerous "comets" of blue cells (Fig. 10A). Each comet represents Ad-P-gal which has 
spread from one initially-infected cell. Most of the cells within a comet were stained with X- 
gal (Fig. 10C). Comets were also observed with J/01/07, but not to the extent of KD1 and 

20 KD3 (Fig. 10A). With rf/327 (ADF), there was little spread from the originally infected cell 
(data not shown). In summary, KD1 and KD3 not only complement the replication of Ad-P- 
gal, they also enhance its rapid spread. 

It is expected that KDl and KD3 will also complement and enhance the spread of RD 
vectors expressing anti-cancer therapeutic gene products, and this expectation can be readily 
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verified using the Ad-P-gal/FasL in replication and infectious center assays as described 
above. 

KD1 and KD3 not only complement the replication of RD vectors in cell culture, they 
also do so in Hep3B tumors growing in the hind flanks of nude mice. The RD vector used 
5 was AdLuc, an Ad that lacks the El and E3 regions, and has inserted into the El region an 
expression cassette where the firefly luciferase gene is expressed from the Rous Sarcoma 
Virus promoter (Harrod et al., Human Gene Therapy9\ 1885- 1898, 1998). The Hep3B tumors 
were injected with 1 x 10 7 PFU of AdLuc plus buffer, or 1 x 10 7 PFU of AdLuc plus 5 x 10 7 
PFU of KD1, KD3, d/01/07, or d/309. After 2 weeks, mice were sacrificed and tumors 

1 0 excised. Proteins were extracted from the tumors and luciferase activity determined using a 
luminometer. The luciferase counts per tumor were 6,800 for AdLuc plus buffer, 1 13,500 for 
KD1, and 146,900 for KD3 (Fig. 1 1). Thus, KD3 and KD1 respectively caused a 22-fold and 
1 7-fold increase in luciferase activity. This increase could be due to elevated synthesis of 
luciferase in cells that were initially coinfected the AdLuc and KD1 or KD3, and it could also 

1 5 be due to spread of AdLuc from cell to cell in the tumor as mediated by KD1 or KD3. 

In summary, infecting a tumor with a replication-competent ADP-overexpressing 
vector according to the invention together with a RD vector expressing an anti-cancer gene 
product should greatly increase the amount of anti-cancer protein synthesized in the tumor 
thereby increasing the ability of the replication-defective vector to promote destruction of the 

20 tumor. 

Example 6 

This example illustrates the construction and characterization of a recombinant Ad 
vector according to the invention which is replication-restricted to cancerous type II alveolar 
cells. 

25 As demonstrated above, the J/01/07 mutation in KD1 and KD3 limits growth of these 

vectors to cancer cells. To further restrict their replication phenotype, the E4 promoter in 
each virus was deleted and replaced by the surfactant protein B (SPB) promoter to produce 
vectors named KD1-SPB (SEQ ID NO: 14), KD3-SPB (SEQ ID NO: 15), and J/01/07-SPB 
(SEQ ID NO:16). The SPB promoter is only active in cells containing the TTF1 transcription 

30 factor, which has thus far been found primarily in type II alveolar cells of the human lung 
(Lazzaro et al., Development 7/5:1093-1 104, 1991). Thus, KDl-SPB, KD3-SPB, and 
J/01/07-SPB should be severely restricted to cancerous type II alveolar cells of the human 
lung. Many lung cancers are of this type. 

The KDl-SPB and KD3-SPB vectors were prepared as follows. The E4 promoter is 

35 located at the right end of the Ad genome (Fig. 1). Using a pCRII-based plasmid (Invitrogen) 
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containing the Ad5 DNA sequences from the BamHI site (59 map units) to the right hand end 
of the genome, and using and a PCR-based protocol, nearly all the transcription factor binding 
sites were deleted from the E4 promoter Ad5 base pairs 35,623 to 35,775 and replaced with a 
500 base pair fragment containing the SPB promoter (Yan et aL, /. Biol. Chem. 27024852- 
5 24857, 1 995). The final plasmids contain the E4-SPB substitution in the E4 region and die 
dlQMQly KD1, or KD3 versions of the E3 region, respectively, for the viruses rf/01/07-SPB, 
KD1-SPB, and KD3-SPB. These plasmids were co-transfected into 293 cells with a fragment 
containing the left portion of the genome of rf/01/07, and plaques were allowed to develop. 
Plaques were screened for the expected features, purified, then expanded into a stock. 

1 0 The A549-TTF1 cell line was developed in order to test the prediction that replication 

of J/01/07-SPB, KD1-SPB, and KD3-SPB would be restricted to cancerous cells expressing 
the TTF1 transcription factor. These cells were co-transfected with two plasmids, one in 
which TTF1 is expressed from the CMV promoter, and the other coding for resistance to 
neomycin Resistant clones were isolated and shown to express li t I activity as determined 

15 by transient transfection with a plasmid expressing chloramphenicol acetyltransferase from 
the TIT 1 -requiring surfactant protein C promoter. 

KD1-SPB and KD1 were subjected to a standard plaque development assay on A549- 
TTF1 cells and parental A549 cells. The results are shown in Fig. 12. With KD1-SPB on 
A549 cells, plaques were not visible after 8 days, only about 4% of the final number of 

20 plaques were seen after 10 days, and about 50% of final plaques were seen after 12 days. 
With KD1-SPB on A549-TTF1 cells, plaques were visible after 6 days, and about 60% of 
plaques were seen after 10 days. Thus, as expected, KD1-SPB grew significantly faster on 
the cells containing TTF1 . KD1 formed plaques more quickly than KD1-SPB on both A549 
and A549-TTF1 cells, indicating that the E4 promoter-SPB substitution is not as effective the 

25 wild-type E4 promoter in inducing Ad replication. However, this difference between KD1- 
SPB and KD1 on A549-TTF1 cells is tolerable, with KD1-SPB delayed only about 1 day. 
Curiously, the final titer obtained for all virus stocks by day 16 was similar, indicating that 
A549 cells may contain a very small amount of endogenous TTF1 activity. It is predicted that 
KD3-SPB and J/01/07-SPB will behave similarly to KD1-SPB when grown in A549-TTF1 

30 cells and A549 cells. 

The restriction of KD1-SPB to cells containing TTF1 was further examined in a cell 
spread assay using H441 cells, a TTF1 -expressing human pulmonary adenocarcinoma cell 
line (Yan et al., supra), and Hep3B cells, a liver cancer cell line not expected to express 
TTF1 . Culture dish wells containing H441 or Hep3B cells were infected with KD1-SPB or 

35 KD1 at multiplicities ranging from 10 to 10 -1 PFU/cell. The H441 and Hep3B cells were 
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stained with crystal violet at 5 days and 8 days p.i., respectively. KD1-SPB and KD1 grew 
and spread equally well on H441 cells, causing destruction of the monolayer at 10" 1 PFU per 
cell (Fig. 13). (Some of the H441 monolayer has peeled off in the well with KD1-SPB at 10~ 2 
PFU per cell, and in the wells with KD1 and KD1-SPB at 10" 4 PFU per cell; mis occasionally 
5 occurs in cell spread assays, and it does not reflect virus infection). With Hep3B cells, KD1 
grew and spread very much better than KD1-SPB, with 10" 2 PFU per cell of KD1 causing 
more destruction of the monolayer as 1 .0 PFU per cell of KD1-SPB (Fig. 13). 

In summary, this example demonstrates that a replication-competent Ad, which 
replicates well on cells expressing the appropriate transcription factor, can be constructed 

1 0 with a tissue-specific promoter substituted in place of the E4 promoter. This methodology 
should be applicable to many other tissue specific and cell type specific promoters. One 
possibility would be a liver-specific promoter. Another possibility would be to use the E2F 
promoter, or another promoter with E2F sites, inasmuch as that promoter would be active 
only in cells such as cancer cells that have free E2F. A third possibility would be to use a 

1 5 regulatable promoter, e.g. the synthetic tetracycline response promoter (Massie et al., 7. ViroL 
72.2289-2296, 1998), where the activity of the promoter is controlled by the level of 
tetracycline or a tetracyclin analog in the patient 

Example 7 

This example illustrates the construction and characterization of vectors which 

20 overexpress ADP and are not replication restricted. 

As demonstrated above, the dlOl/Ql El A mutation in KD1 and KD3 is attenuating, 
inhibiting growth in non-dividing and even in dividing primary human epithelial and 
endothelial cells. Ads with this mutation are able to replicate well in dividing cancer cells. 
However, replication of such El A mutants is not as efficient as, e.g. J/309 which has a wild- 

25 type El A gene. For instance, the rate of replication of J/01/07, as determined by the rate at 
which plaques develop, is reduced such that J/01/07 plaques appear one day later than those 
of J/309 (data not shown). This delay is due in part to a delay in expression of Ad late genes 
(see Fig. 3). The idea that the dIO 1/07 mutation retards the rate of replication in A549 cells is 
further supported by the data in Fig. 8A, where J/01/07 did not prevent tumor growth nearly 

30 as well as J/309. Despite this negative effect of the J/01/07 E1A mutation, there are 

theoretical and practical aspects of having this mutation in the KD1 and KD3 vectors, as has 
been discussed. Nevertheless, one can easily imagine scenarios (e.g. patients with terminal 
cancer) where the ability of an Ad vector to destroy the tumor supercedes the requirement that 
the vector be totally restricted to tumor cells. In such cases, it would be advantageous to have 

35 vectors similar to KD1 and KD3, but with the wild-type El A gene. The rates at which such 
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vectors express their genes, lyse cells, and spread from cell to cell should be higher than those 
of KD1 and KD3. Such vectors might cause some damage to non-cancerous cells and tissue, 
but this is also true for other modes of anti-cancer treatment such as surgery, chemotherapy, 
and radiation therapy. 

5 In light of these considerations, vectors named GZ1 and GZ3 have been constructed 

that are identical to KD1 and KD3, respectively, except they have a wild-type El A region. 
These vectors were constructed by overlap recombination in A549 cells. The left hand 
fragment contained the wild-type El A region of Ad5, and the right end fragment contained 
the E3 modifications of KD1 or KD3. Plaques were picked, analyzed for the expected 

10 genotype, plaque-purified, and expanded into CsCl-banded stocks. The titers of these stocks 
on A549 cells were 2.9 x 10 10 PFU/ml for GZ1 and 1.6 x 10 11 PFU/ml for GZ3. Thus, these 
vectors can be grown into high titer stocks comparable to wild-type Ad. The GZ1 and GZ3 
plaques are larger and appear much sooner than the plaques for d/309. Large rapidly- 
appearing plaques reflect the ability of Ad to lyse cells and spread from cell-to-cell (Tollefson 

15 et aL, J. Virol 70:2296-2306, 1996; Tollefson et aL, Virology 220:152-162, 1996), and this 
property, as discussed, is due to the function of ADP. 

The rate of plaque appearance can be quantitated in a plaque development assay 
(Tollefson et aL, supra). Here, a typical plaque assay is performed, and the plaques observed 
on subsequent days of the assay are calculated as a percentage of the number of plaques 

20 observed at the end of the plaque assay. As shown in Fig. 14, after 4 days of plaque assay on 
A549 cells, GZ1 and GZ3 had 48% and 34%, respectively, of the final number of plaques, 
whereas J/309 had only 1%. It is very unusual in Ad plaque assays in A549 cells for plaques 
to appear after only 4 days. These large plaques reflect the overexpression of ADP. These 
GZ1 and GZ3 plaques appear sooner than those of KD1 and KD3 (data not shown), no doubt 

25 because GZ1 and GZ3 replicate faster because they have a wild-type El A region. 

GZ1 and GZ3 lyse cells and spread from cell to cell much more effectively than 
J/309. At 6 days p.i. of A549 cells, approximately as much monolayer destruction was 
observed with GZ1 and GZ3 at 10 3 PFU per cell as was observed with d/309 at 10" 1 PFU per 
cell (Fig. 15, top panel). This result further underscores the conclusion mat overexpression of 

30 ADP promotes cell lysis and virus spread. 

In theory, GZ1 and GZ3 should be able to replicate not only in tumor cells but also in 
normal cells. Although they can replicate in normal cells, it is quite possible that GZ1 and 
G23 may be useful as anti-cancer vectors. First, GZ1 and GZ3 could be injected directly into 
the tumor. Many tumors are self-contained (encapsulated) except for the blood supply. The 

35 physical barriers of the tumor could minimize dissemination of the virus to other tissues. 
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Second, Ads are in general quite benign. Most infections of Ad5 are in infants and result in 
mild or asymptomatic disease, and are held in check by strong humoral and cellular 
immunity. Anti-Ad immunity appears to be life-long. GZ1 and GZ3 could be used only in 
patients who have an intact immune system, and perhaps also with pre-existing anti-Ad 
5 immunity. Further, patients could be passively immunized against Ad, using gamma-globulin 
or even specific purified anti-Ad neutralizing antibodies. Third, considering that Ad5 is a 
respiratory virus which most efficiently infects lung epithelial cells displaying the specific 
Ad5 receptor (named CAR) as well as specific mtegrins (e.g. avb5), replication-competent 
vectors derived from Ad5 may not spread efficiently in many non-cancer tissues of the body. 
10 In addition, it is believed that versions of GZ 1 and GZ3 can be constructed that have the E4 
promoter substituted with a tumor-specific, tissue-specific, cell-specific, or synthetic 
promoter. Such vectors would have the positive features associated with wild-type El A and 
ADP, and yet be replication-restricted to tumor tissue and/or to particular cell types. 

Example 8 

1 5 This example illustrates that the combination of KD 1 , KD3, GZ 1 , or GZ3 with " 

radiation is more effective in destroying A549 cells, growing in culture or growing as tumors 
in nude mice, than the vectors alone or radiation alone. 

This was shown in a cell spread assay. A549 cells growing in three 48 well culture 
dishes were mock-infected or infected with different viruses at multiplicities of infection 

20 rangingfrom lOto lO^PFU per cell as indicated in Fig. 15. One dish was not radiated. A 
second dish received 600 centrigreys (cGy) of radiation at 24 h p.i., and a third dish received 
2000 cGy of radiation at the same time. All dishes were stained with crystal violet at 6 days 
p.i. With the cells that were not radiated (top panel in Fig. 15), KD1 and KD3 caused 
monolayer destruction at lower multiplicities of infection than their parental control, J/01/07. 

25 This was also true for GZ1 and GZ3 as compared to their parental control J/309. (The 

paucity of cells in the cells infected with GZ1 or GZ3 at 10 - * PFU per cell is an experimental 
artifact, and is not caused by infection by GZ1 or GZ3). These KD1, KD3, GZ1 and GZ3 
results are consistent with earlier results showing that overexpression of ADP leads to 
increased cell lysis and virus spread. 

30 With the dish that was infected then radiated with 600 cGy there was markedly 

increased cell killing and virus spread as compared to the non-radiated cells (compare the 
bottom panel of Fig. 15 with the top panel). For example, with KD1, KD3, GZ1, and GZ3 
there was about the same amount of cell destruction in the radiated wells at 10* 4 PFU per cell 
as in the non-radiated wells at 10" 2 PFU per cell. Similar results were seen with the dish that 
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received 2000 cGy of radiation (data not shown), and also with dishes that received 600 or 
2000 cGy of radiation 24 h prior to infection (data not shown). 

The amount of cell destruction was quantitated by extracting the crystal violet from 
the cells with 33% acetic acid, then measuring the absorbance at 490 nrn (data not shown). 
5 The absorbance with non-radiated mock-infected cells was set at 100% cell viability. With 
mock-infected cells that received 600 cGy there was a 15% loss in viability (i.e. 15% less 
crystal violet was extracted). With KD1 at 10" 3 PFU per cell, the non-radiated cells were 80% 
viable whereas the cells receiving 600 cGy of radiation were only about 30% viable. Similar 
differences in viability between radiated and non-radiated cells were seen with KD3, GZ1, 
10 and GZ3. These results argue that the combination of radiation plus vector has a syngergistic 
effect on cell lysis and vector spread, rather than an additive effect. If the effect were only 
additive, then with the KD1 samples at 10* 3 PFU per cell, the cell viability should have been 
65% (15% reduction in viability due to radiation alone, 20% reduction due to KD1 alone). In 
fact, the cell viability was 30% rather than 65%. 
1 5 As mentioned, approximately as much cell lysis and virus spread were observed with 

600 cGy as with 2000 cGy. To determine the optimal dose of radiation to synergize with the 
vectors, an experiment similar to the one described above was conducted with mock-, 
J/01/07-, KD1-, KD3-, d/309, GZ1-, or GZ3-infected A549 cells. The 48 well plates received 
0, 150, 300, or 600 cGy of radiation at 24 h p.i. Cells were stained with crystal violet The 
20 results with cells receiving 0 versus 600 cGy of radiation were similar to those in Fig. 15. 
The crystal violet was extracted from the cells infected with 10 3 PFU per cell of the 
difference viruses. The absorbance of crystal violet was determined, and the percent cell 
viability was graphed, using the absorbance of the non-radiated mock-infected cells as 100% 
cell viability. As illustrated in Fig. 16, an approximately linear decrease in cell viability in all 
25 wells was obtained with increasing radiation dose, although the slope of the line was more 
negative with KD1, KD3, GZ1, or GZ3 than with mock, J/01/07, or rf/309. With KD1, KD3, 
GZ1, and GZ3, there was much more cell lysis and vector spread with their parental control 
viruses, and there was synergy between the vectors and radiation. For example, with mock- 
infected cells, 600 cGy reduced cell viability by about 30% (70% of cells were viable). KD1 
30 without radiation reduced cell viability by about 23%. The combination of 600 cGy radiation 
plus KD1 reduced cell viability to about 85%, more than 53% of which is the sum of radiation 
alone and KD1 alone. When considering the data in Figs. 15 and 16 together, a dose of about 
600 cGy is optimal in this type of cell culture experiment 

The combination of KD3 or GZ3 with radiation was also examined in the A549 
35 tumor-nude mouse model (see Example 4). A549 cells were injected into the hind flanks of 
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nude mice, and tumors were allowed to form. When tumors reached approximately 50-ui, 
they were injected with buffer or with 5 x 10 8 PFU of KD3 or GZ3. Eight to ten tumors were 
injected per test condition. At 1 day p.i., half the mice received 600 cGy of whole body 
radiation. Tumor size was measured over time, and was plotted as a fold-increase in tumor 
5 size versus days p.i. (as described in Example 4). As shown in Fig. 17, the non-radiated 
buffer-injected tumors grew faster than those injected with KD3 or GZ3. Tumors that 
received the combination of KD3 and radiation did not grow, and those that received the 
combination of GZ3 and radiation shrank in size after 14 days. These results indicate that the 
combination of KD3 plus radiation or GZ3 plus radiation is more effective than either vector 
1 0 alone or radiation alone in reducing the rate of A549 tumor growth in nude mice. It is likely 
that radiation would increase the effectiveness in treating tumors of KD1 and GZ1, or indeed 
any other replication-competent or replication-defective Ad vector. 

The mechanism by which radiation causes the ADP overexpressing vectors to lyse 
cells and spread from cell-to-cell more effectively is not understood. Radiation is expected to 
1 5 induce cellular DNA repair mechanisms, and that may allow for more efficient synthesis of 
AdDNA. Radiation may enhance the function of ADP. ADP probably functions by 
interacting with one or more cellular proteins, and radiation may affect this protein(s) such 
that ADP functions more efficiently. 

It is believed that KD1, KD3, GZ1, or GZ3, or any other replication-competent Ad 
20 vector, when used in combination with radiation, will be more effective than vector alone or 
radiation alone in providing clinical benefit to patients with cancer. The vectors should allow 
more tumor destruction with a given amount of radiation. Stated another way, radiation 
should cause more tumor destruction with a given amount of vector. These vectors should 
also allow the radiation oncologist to use less radiation to achieve the same amount of tumor 
25 destruction. Less radiation would reduce the side effects of the radiation. 

It is also believed that a cocktail of vectors when used in combination with radiation 
will be more effective than the cocktail alone or radiation alone. The cocktail could consist of 
ADP producing vectors plus one or more replication defective vectors expressing an 
anticancer therapeutic protein (see Example 5). 
30 Example 9 

This example illustrates a structure-function analysis of adenovirus death protein. 
ADP is an 1 1.6 kDa N-linked O- linked integral membrane glycoprotein that localizes 
to the inner nuclear membrane (NM) (Scaria et al., Virology 191:743-753). As illustrated in 
Fig. 1 8, the Ad2-encoded ADP (SEQ ID NO:6) consists of 101 amino acids; aa 1-40 (SEQ ID 
35 NO:17) are lumenal, aa 41-59 (SEQ ID NO:18) constitute the transmembrane signal-anchor 
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(SA) domain, aa 63-70 (SEQ ID NO: 19) constitute a basic proline (BP) domain within the 
nucleoplasms (NP) domain, which constitutes aa 61-101 (SEQ ID NO:20). To determine 
which domains in ADP are required to promote cell death, a number of deletion mutants of 
reclOO were prepared which lacked various portions of the ADP gene and examined for the 
5 ability of ADP to localize to the NM and promote death. The rec700 virus is an Ad5-Ad-Ad5 
recombinant, which has been described elsewhere (Wold et al., Virology 7^:168-180, 1986). 

The structure of ADP in reclOO and in each deletion mutant is schematically 
illustrated in Fig. 18. The ADP gene in each deletion mutant has been sequenced using PCR 
methods to insure that the mutations are correct. The structure and activity of ADP in the 

1 0 deletion mutants was tested by infecting A549 cells followed by immunoblot analysis of the 
ADP mutant proteins as well as the ability to lyse cells. All deletion mutants expressed a 
stable ADP protein except />m734.1 (Al-48, i.e. aa 1-48 are deleted). The /wz734.7 (N M ) 
ADP, which has Asn^ mutated to Ser, is O-glycosylated but not N-glycosylated because 
Asn M is the only N-glycosylation site (data not shown). The J/735 (A4-1 1) ADP is N- 

1 5 glycosylated but not O-glycosylated because the sites for O-glycosylation are deleted (data 
not shown). The pm!3AA (M56) ADP, which has Met 56 in the SA domain mutated to Ser, 
contains exclusively N-linked high-marmose oligosaccharides (data not shown); this occurs 
because the Met 56 mutation precludes exit of ADP from the endoplasmic reticulum (ER). The 
d!738 ADP, which lacks aa 46-60 in the signal-anchor domain, forms insoluble aggregates in 

20 the cytoplasm; therefore, aa 41-59 do in fact include the signal-anchor domain. The p/»734 
(A 1-40) ADP, which initiates at MeUi at the N-terminus of the SA domain, comigrated with 
the lower group of bands generated by proteolytic processing (data not shown). This 
indicates that the proteolytic cleavage sites occur near MeUi. Consistent with this, the 
proteolytic products were not seen with rf/737 (A29-45) (data not shown). Also, the size of 

25 the products decreased in all mutants with deletions within aa 41-101 (rf/715.1, J/715, </J714, 
dH 16) (data not shown). 

The.ability of these mutants to promote cell death was monitored by trypan blue 
exclusion, plaque development, and lactate dehydrogenase release assays (Tollefson et al., /. 
Virol 70:2296-2306, 1996). The trypan blue results in Fig. 15A indicate that the death- 

30 promoting function of ADP was abolished by deletion of aa 1-40 (pm734), aa 1 1-26 

(<ff736.1),aa 18-22 (<ff735.1), or aa 4-11 (dU3S). Mutation of the N-glycosylation site at 
Asn 14 (pm734.7) reduced the death-promoting activity to about 50% of rec700 (WT). dTIZl 
(A29-45) was efficient as rec700 in promoting cell death; this indicates that the proteolytic 
processing products must not be required to promote cell death because they are not formed 

35 with rf/737. The SA domain is essential for death because dH3Z (A46-60) and pmlZAA 
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(M56) were completely defective (Fig. 19). dfl 15.1 was nearly completely defective, 
indicating that the BP domain is extremely important Surprisingly, aa 71-94 (<//714), 76-89 
(rf/715), and 79-101 (<#716) could be deleted without affecting the death-promoting activity 
of ADP (Fig. 19). On the other hand, deletion of aa 81-88 (<#717) nearly completely 
5 abolished the activity of ADP (Fig. 1 9); this is probably the result of aberrant sorting of ADP 
(see below). Similar results were obtained when the ability of these ADP mutants to promote 
cell death was examined with standard plaque development, LDH-release and MTT assays. 

The effects of these mutations on the intracellular localization of ADP are extremely 
interesting. When examined by immunofluorescence (IF) at 33 h p.i. (data not shown), ADP 
1 0 from rec700 (WT) localized crisply to the NM; localization to the Golgi was also apparent. 
With rf/714 (A71-94) and rf/715 (A76-89), ADP localized to all membranes, i.e. the ER, Golgi, 
plasma membrane, and NM; This was even more apparent at 45 h p.i. (data not shown) 
Thus, aa 71-94 appear to include a signal that directs ADP specifically to the NM. ADP is 
very likely sorted from the /rarar-Golgi network (TGN) to the NM, so this putative signal in 
15 ADP probably functions in this sorting pathway. ADP from rf/717 (A81-88) is intriguing: it 
localized to the NM and Golgi, but in many cells "dots" and circular structures were observed. 
Again, this was more apparent at 45 h p.i. when these structures were the prominent feature. 
J/717-infected cells have not begun to die at 45 h p.i., so these structures are not cellular 
remnants. The intriguing possibility is that these structures are membrane vesicles that have 
20 pinched off from the TGN but are defective in targeting to and/or fusing with the NM. 

With <//738 (A46-60 in the SA domain), ADP aggregated in the cytoplasm. This 
again indicates that aa 46-60 include the SA sequence. With />m734.4 (M56), ADP localized 
primarily to the NM. As discussed above, the pmlZAA ADP has exclusively high-mannose 
N-linked oligosaccharides, indicating that it never leaves the ER. Perhaps the putative NM- 
25 localization signal in the C-terminal region of the /wn734.4 ADP targets ADP to the NM by 
lateral diffusion from the ER (which is continuous with the outer and inner NM). 

With <ff737 (A29-45), ADP localized to the NM. ADP from pmlTA (Al-40),pm734.7 
(N14) (N-linked glycosylate cannot occur), and d/735 (A4-1 1; the O-glycosylation sites are 
deleted) localized much more prominently to the Golgi than the NM. ADP from d/735. 1 
30 (A18-22) and d/736.1 (Al 1-26) also localized much more strongly to the Golgi than the NM. 
Thus, residues 1-26 and/or glycosylation appear to be required for efficient transport of ADP 
from the Golgi/TGN to the NM. 

In summary, aa 41-59 include the SA domain, Met* in the SA domain is required for 
exit from the ER, aa 1-26 are required for efficient exit from the Golgi, and aa 76-94 are 
35 required to target ADP specifically to the NM. With respect to promoting cell death, the 
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essential regions are aa 1 -26, the SA domain (ADP does not enter membranes), Met* in the 
SA domain, and the BP domain (aa 63-70). It is not clear whether the defective death- 
promoting phenotype of/wi734 (Al-40), (A4-1 1), <tf735.1 (A18-22), d/736.1 (Al 1-26), 
and pm734.7 (N14) is due to lack of sequences (or oligosaccharides) that promote death or to 
5 much slower exit of ADP from the Golgi to the NM. rf/714 (A71-94) and d!7\5 (A76-89) 
express a wild-type phenotype for promoting death even though they are defective in 
localizing specifically to the NM; this is probably because sufficient ADP still enters the NM 
to promote death. Even though the deletion in dll 1 7 (A8 1-88) lies within the deletions in 
J/715 (A76-89) and d/714 (A71-94), the dHll ADP is only about 15% as efficient as reclQO 
10 (WT), dHlS and d/714 in promoting death. This may be because the dTIM ADP tends to 

remain in vesicles rather than localizing to the NM. Altogether, these data indicate that ADP 
must localize to the NM in order to promote cell death. 

Example 10 

This example further characterizes the tissue specific Ad vectors described in Example 6. As 

15 discussed therein, the Ad E4 promoter is deleted and replaced with the promoter for surfactant 
protein B (SPB) in these vectors (Figure 24). 
Materials and Methods 

Cells, vectors and methods described in Example 6 were also used in this Example. 
In addition to the human cancer cell lines A549 (human lung carcinoma), Hep 3B (human 

20 hepatocellular carcinoma), and H441 (papillary lung adenocarcinoma) used in Example 6, 
HEK 293 cells (obtained from Microbix (Toronto, ON)) and VK10-9 cells were used. VklO- 
9 cells are 293 cells that in addition to El contain and express E4 and pDC. These cells will 
be referred to as 293-E4 cells. 

Experiments employing phase contrast microscopy of Hep 3B and H441 cells were 

25 performed as follows. Monolayers of Hep 3B or H441 cells were grown in 60 mm dishes 

with 5 ml of DMEM (10% FBS), and were mock-infected or infected with KD1 or KD1-SPB 
at a multiplicity of infection of 10 plaque forming units (PFU) per cell. Phase contrast 
photographs of monolayers were taken at 4 and 7 days postinfection (p.i.). 

Experiments employing western blots of H441 or Hep 3B cells were performed as 

30 follows. H441 or Hep 3B cells (in 60 mm dishes) were infected with 10 PFU/cell of KD1 or 
KD1-SPB. At 24 h p.i., the cells were washed three times with PBS and harvested by 
scraping. The cells were lysed by RIPA buffer. The protein concentration was measured by 
the BIO-RAD DC Protein Assay Kit (BIORAD Laboratories, Hercules, CA) and 10 pgof 
each sample were electrophoresed on 15% sodium dodecy [sulfate polyacrylamide gels (SDS- 

35 PAGE). The gels were electroblotted onto PVDF membranes (Immobilon, Millipore, 
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Bedford, MA). The membranes were blocked in TBST (50 mM Tris-Cl, pH 7.6, 150 mM 
NaCl, 02% Tween 20) containing 10% dry milk (Carnation) overnight at 4°C After 
blocking, the membranes were incubated with a rabbit polyclonal antiserum against E40RF3 
(gift of Gary Ketner) or ADP (Tollefson et al., J. Virol. 65:3633-3642, 1992), or with M73, a 
5 monoclonal antibody against El A (Harlow et al., J. Virol 55:533-546, 1985). The secondary 
antibodies were goat anti-rabbit IgG-HRP or goat anti-mouse IgG-HRP. The blots were 
developed using the ECL protocol (Amersham Pharmacia, Arlington Heights, IL). 

Experiments employing a lactate dehydrogenase release assay for cell lysis (Tollefson 
et al., J. Virol 70:2296-2306) were preformed as follows. H441 cells (7.7 x 10 5 cells per 35 

10 mm dish) and Hep 3B cells (9.0 x 10 s cells per 35 mm dish) were infected at 20 PFU/cell in 
one ml serum-free DMEM. After an adsorption period of 1 h, 3 ml of DMEM (10% FBS) 
were added (final FBS concentration of 7.5%). Cells were incubated at 37°C with 6% CO2. 
At daily intervals, supernatants were collected, microfuged to remove floating cells, and cell- 
free supematants were frozen at -70°C until assayed. Total lysis samples were prepared by 

1 5 addition of 1 OX lysis buffer included in the Cyto Tox 96 kit (Promega, Madison, WI). After 
all samples were collected, 20 pJ samples were assayed in triplicate using the LDH assay kit 
Cyto Tox 96 and read on an EL340 Microplate reader (BioTecTM Instruments, Inc.) at 490 
nm. 

Experiments employing immunofluorescence evaluation of H441 and Hep 3B cells 

20 were performed as follows. H441 and Hep 3B cells were plated on Coming #1 coverslips in 
35 mm dishes. H441 (1.5 x 10 6 cells/35 mm dish) and Hep 3B (9.0 x 10 s cells/35 mm dish) 
were infected with 20 PFU/cell of the indicated viruses in 1 ml serum-free DMEM. After 1 h, 
1 ml of DMEM/20% FBS was added (final concentration of 10% FBS). At the indicated 
times (48 h or 6 d p.i.), cells were fixed for 10 min in 3.7% paraformaldehyde in PBS, then 

25 permeabilized for 6 min in methanol (-20°C) and rehydrated in PBS. Coverslips were stained 
with rabbit antipeptide antiserum against the Ad E2A-coded DNA binding protein (DBP) 
(1:400 dilution; gift of Maurice Green) and mouse monoclonal antibody against fiber (1:400 
dilution; gift of Jeff Engler) or were stained with rabbit antiserum to E40RF3 (1 :250 dilution; 
gift of Gary Ketner). Secondary antibodies (Cappel/ICN) were used at 1:50 dilution. All 

30 antibodies were diluted in PBS containingl% BSA and 0. 1% sodium azide. Photographs 

were taken on a Nikon epifluorescence microscope using a 100X Planapo lens and Tmax 400 
film (Kodak). The film was developed in Diafine developer. 

Analysis of viral DNA replication by Southern hybridization was performed as 
follows. H441 and Hep 3B cells were grown in 60 mm dishes in DMEM supplemented with 

35 10% FBS. Cells were infected at 70% confluence with 10 PFU/cell of KD1 or KD1-SPB. 
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Dishes were incubated in humidified 5% CO2 atmosphere at 37°C. Total genomic DNAs 
were isolated at 5, 24, 48, 72, and 96 h p.i. Equal amounts of total genomic DNAs were 
digested with Hindm and resolved on a 1% agarose gel prior to transfer onto membranes. A 
random primer 32 P-labeled pBHGlO plasmid probe (Bett et aL, Proa Natl. Acad ScL USA 
5 91 :8802-8806, 1 994) was used for hybridization, and the blots were autoradiographed. DNA 
fragments were quanutated on a Molecular Dynamics Phosphorlmager. 

Virus yields were determined as follows. Hep 3B cells or H441 cells grown as 
monolayers in 35 mm dishes were infected with 10 PFU/cell of KD1 or KD1-SPB. At days 0 
to 4 (for H441) or days 0 to 9 (for Hep 3B) p.i., cells and culture medium were frozen at - 

1 0 70°C. Samples were frozen and thawed three times to release the virus from the cells, and 
total virus yields were determined by plaque assay on A549 monolayers. 

The effect of KD1-SPB and KD1 on H441 and Hep 3B tumors was examined in a 
nude mouse model (Doronin et aL, /. Virol 74:6147-6155, 2000). Tumor cells (10 7 cells in 
200 uJ of DMEM, 50% Matrigel [Becton Dickinson Labware, Bedford, MA] for H441 cells, 

15 or 10 7 cells in 200 uJ of DMEM plus 10% Matrigel for Hep 3B cells) were injected into flanks 
of 5-6 weeks old athymic nude mice and allowed to grow for three weeks to about 100 ul 
(H441) or 150 ul (Hep 3B) volumes. Pre-established tumors (n = 10) were injected with 50 ul 
of DMEM or 5 x 10 7 PFU of indicated viruses in DMEM. Injections of the viruses were 
repeated twice weekly for 3 weeks to the total dose of 3.0 x 10 8 PFU per tumor. Tumor size 

20 measurements were taken twice per week for H441 cells, or weekly for Hep 3B cells using a 
Sylvac digital caliper. Tumor volumes were calculated in according to the formula: length x 
width 2 /2. Data are represented as means of increase in tumor size relative to the tumor size at 
the initial injection. 
Results 

25 The properties of KD1-SPB in various cell types were compared to those of its 

"parent", KD1. Figure 25 shows the plaque development properties of these vectors on 293- 
E4, 293, and A549 cells. The data are plotted as the number of plaques seen on any day of 
the plaque assay as a percentage of the number of plaques seen at the end of the assay (i.e. 
when new plaques cease to appear) (Tollefson et al., J. Virol 70:2296-2306, 1966). This 

30 assay is an indicator of the size of the plaques. KD1 formed plaques equally well on 293-E4 
and 293 cells (Figure 25 A). With KD1-SPB, plaques were observed about 3-4 days sooner on 
293-E4 compared to 293 cells (Fig. 2A). On A549 cells, KD1 formed plaques 4-6 days 
sooner than KD1-SPB (Figure 25B). 

The properties of KD1-SPB versus KD1 were characterized in detail in H441 cells, a 

35 human papillary lung adenocarcinoma cell line known to express the TTF1 transcription 
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factor and in which the SPB promoter is active (Yan et al. a J. Biol. Chem. 270:24852-24857, 
1995). Hep 3B cells, a human hepatocellular carcinoma in which the SPB promoter should 
not be active, were used as a negative control. H441 and Hep 3B monolayers were infected 
with 10 PFU/cell of KD1 or KD1-SPB and photographed at 4 and 7 days p.i. Mock-infected 
5 Hep 3B cells formed a relatively homogeneous monolayer, but H441 cells tended to form 
structures that resemble syncytia (Figure 26 A, B). As expected, KD1 produced cytopathic 
effect (CPE) on both cell lines at 4 and 7 days p.i. (Figure 26A, B). Also as expected, KD1- 
SPB caused CPE on H441 cells but not on Hep 3B cells. Since CPE in Ad-infected cells is 
usually an indicator of virus growth, these results suggest that KD1-SPB grows in H441 but 
10 not in Hep 3B cells. 

To examine viral DNA replication, H441 and Hep 3B cells were infected with 10 
PFU/cell of KD1 or KD1-SPB, then the accumulation of viral DNA was determined by DNA 
blot With H441 cells, KD1 and KD1-SPB DNAs were readily detected at similar levels at 
48-96 h p.i. (Figure 27A). With Hep 3B cells, KD1 DNA levels were similar to those in 
15 H441 cells, but KD1-SPB DNA was barely detectable. This was confirmed by 
Phosphorlmager analysis of the DNA bands (Figure 27B). 

Growth of KD1-SPB and KD1 in H441 and Hep 3B cells was determined by a single 
step growth assay. Cells were infected with 10 PFU/cell of vector, then total vector yield was 
determined by plaque assay. Total yield of both vectors was similar in H441 cells, reaching a 
plateau after 2 days (Fig. 28A). KD1 yield plateaued in Hep 3B cells after 2-4 days p.i. 
(Figure 28B). However, KD1-SPB levels were about 5 logs lower in Hep 3B cells after 2-4 
days, and even by 9 days they had not achieved the levels of KD1. We conclude that KD1- 
SPB grows with significant specificity on H441 versus Hep 3B cells. Further, KD1-SPB 
grows as well as KD1 on H441 cells, indicating that the E4 promoter deletion by itself does 
not significantly compromise the vector, and that the E4 promoter can be replaced by a tissue- 
specific promoter in a replication-competent vector. 

To obtain further details on the replication of KD1-SPB vs KD1 in H441 and Hep 3B 
cells, the expression of representative Ad proteins by KD1-SPB and KD1 was examined. 
H441 or Hep 3B cells were mock-infected or infected with 10 PFU/ml ofKDl or KD1-SPB, 
then at 24 hp.i. the proteins were extracted and the El A, E40RF3, and ADP proteins were 
examined by immunoblot E40RF3 is one of the six proteins coded by the E4 transcription 
unit (Leppard,y. Gen. Virol 75:2131-2138, 1997). As anticipated, KD1-SPB expressed 
E40RF3 well in H441 cells, but only at trace levels in Hep 3B cells (Figure 29). KD1-SPB 
expressed the El A proteins in Hep 3B cells. Synthesis of El A proteins by KD1-SPB in Hep 
3B cells is expected because El A expression does not require E4 proteins; it also indicates 
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that the block to infection with KD1-SPB is downstream of El A. KD1 expressed El A in 
both cell lines, but the amount was less than obtained with KD1-SPB in Hep 3B cells (Figure 
29). The increased El A levels seen with KDI-SPB may reflect its poor ability to enter the 
late phase of infection (see Discussion). KD1-SPB expressed ADP as well as KD1 in H441 
5 cells, but it did not make detectable ADP in Hep 3B cells. ADP is primarily a late protein, so 
this result is consistent with the relative lack of E4 protein expression, DNA replication, and 
growth of KDi-SPB in Hep 3B cells. 

To gain insights into replication events that occur in individual cells, expression of 
E40RF3, the E2A-DBP, and the fiber late protein was examined by immunofluorescence. 

10 H441 or Hep 3B cells were infected with 20 PFU/cell. At 48 h or 6 days p.i., cells were fixed 
and immunostained. E40RF3 was detected in the nuclei of H441 cells at 48 h p.i. with KD1, 
KDI-SPB, or dl309 (Figure 30A). (dl309 is an Ad5 mutant that has wild-type El A, expresses 
Ad5 levels of ADP, and lacks the E3-RID and E3-14.7K genes). E40RF3 could not be 
detected in the vast majority of Hep 3B cells infected with KD1-SPB (Figure 30A), even at 6 

15 days p.i. (Figure 30B). Thus, KD1-SPB expresses E40RF3 well in H441 but not in Hep 3B 
cells. 

Figure 3 1 A shows double label immunofluorescence of DBP and fiber in the same 
Hep 3B cells at 48 h p.i. with KD1 or KDI-SPB. With KD1, there was a strong speckled 
staining pattern in the nucleus that is typical for DBP at 48 h p.i. (Figure 3 1 A, top left panel). 

20 There was strong staining of fiber throughout these same cells (Figure 3 1 A, top right panel). 
Staining of the cytoplasm and nucleus is expected because fiber is synthesized in the 
cytoplasm and then transported to the nucleus where virions assemble. With KD1-SPB at 48 
h p.i., about 25% of the cells showed the speckled staining for DBP, and only one cell (7% of 
total) with the advanced speckled pattern was also stained for fiber (Figure 3 1 A, bottom two 

25 panels). Even at 6 days p.i., only about 30% of cells showed staining for DBP, and about 
20% for fiber (Figure 3 IB). Thus, markedly fewer Hep 3B cells infected with KD1-SPB 
expressed DBP and especially fiber as compared to KD1. These results indicate that KD1- 
SPB replicates as well as KD1 in H441 cells, no doubt because the SPB promoter is active in 
H441 cells (Yan et al., /. Biol. Chem. 270:24852-24857, 1995). KD1-SPB barely replicates 

30 in Hep 3B cells, presumably because the SPB promoter is minimally active in these cells. 

At the culmination of replication, Ad-infected cells are lysed and the virus spreads to 
other cells; this process is mediated in large part by ADP (Tollefson et al., Virology 220:152- 
162, 1996; Tollefson etaL, J. Virol 70:2296-2306, 1996). To examine vector-induced cell 
lysis, H441 and Hep 3B cells were mock-infected or infected with 20 PFU/cell of KD1, KD1- 

3 5 SPB, or dl309, and cell lysis was determined by release of lactate dehydrogenase (Tollefson et 
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ah, J. Virol 70:2296-2306, 1996). All vectors lysed H441 cells beginning at 2-3 days p.i. 
(Figure 32A). KD1 and dl309 also lysed Hep 3B cells in the same time period; however, 
KD1-SPB caused only minimal cell lysis (Figure 9B). Thus, these data, along with the cell 
spread data in Example 6 and Figure 13, demonstrate that KD1-SPB lyses cells and spreads 
5 efficiently from cell-to-cell in H44 1 but not Hep 3B cells. 

An experiment was conducted to determine whether KD1-SPB or KD1 would 
suppress H441 tumors in nude mice. H441 cells were injected into each hind flank. When 
tumors had grown to about 100 yd (H441) or 150 pJ (Hep 3B), they were injected twice 
weekly for 3 weeks with DMEM (mock) or 5 x 10 7 PFU of test vims in 50 pi of DMEM (3.0 

10 x 10 8 total PFU). Ten tumors (5 mice) were used for each virus. Growth of H441 tumors was 
suppressed similarly by KD1-SPB and KD1 (Figure 33A). KD1 suppressed growth of Hep 
3B tumors, whereas KD1-SPB caused only minimal suppression (Figure 33B). These results 
show that KD1-SPB is as effective as KD1 in suppressing tumors when the SPB promoter is 
active. Further, the cell type specificity observed with KD1-SPB in vitro is maintained in 

15 vivo. 

Discussion 

Tumor specificity is one of the biggest challenges facing cancer gene therapy, i.e. 
having the therapeutic gene be expressed specifically in cancer cells. Specificity is very 
important for RC viruses. Two main strategies have been described that in theory confer 

20 specificity: transductional targeting and transcriptional targeting. Directing specificity of 
vectors toward specific cell surface receptors on the target cells has been attempted through 
various methods. Although this approach is theoretically attractive it might encounter 
multiple obstacles such as the lack of incorporation of the engineered protein into the virion 
(Scaria et ah, Virology 797:743-753, 1992) or lack of infectivity through the targeted receptor 

25 (Cosset et aL, 7. Virol 69:63 14-6322, 1995). Transcriptional targeting utilizes tumor and 

tissue specific promoters. In replication-defective vectors these regulatory sequences confine 
the expression of cytotoxic genes to specific tissues. In replication-competent vectors, as an 
added layer of regulation, vector replication per se can be placed under the control of tumor or 
tissue specific promoter/enhancer sequences. In replication-competent Ad, insertion of the 

30 tissue or tumor specific promoter/enhancer into the El A promoter/enhancer region has been 
used exclusively (Hallenbeck et aL, Hum. Gene Ther. 70:1721-1733, 1999; Rodriguez et aL, 
Cancer Res, 57:2559-2563, 1997; Yu et aL, Cancer Res. 59, 4200-4203, 1999; Yu et aL, 
Cancer Res. 59:1498-1504, 1999). The rationale behind these vectors is that expression of 
El A and therefore the whole Ad transcription program will depend on these tissue or tumor 

35 specific promoters. However, as a generic approach, there may be difficulties. The El A 
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enhancer/promoter is very complex. The enhancer controls not only the El A promoter but 
also distant promoters such as the E4 promoter (Shenk, T. pp. 21 1 1-2148 In B .N. Fields, 
D.M. Knipe, and P.M. Howley (eds.), Fields Virology, Lippincott-Raven, Philadelphia, 
1996). In addition, it has been shown that the El A enhancer in the inverted terminal repeat 
5 region changes tissue specificity of cellular promoters (Shi et aL, Hum. Gene Ther. 5:403- 
410, 1997). Also, the El A enhancer/promoter is partially embedded within the signals 
required to package the Ad genome into virions, and it may be problematic to remove all the 
El A enhancer elements without impairing virus production. Accordingly, we chose to 
replace the E4 promoter with a tissue specific promoter. E4 genes are essential for Ad 
1 0 replication, and therefore we expected that the replication of the recombinant virus would be 
dependent on the tissue specific regulatory elements. 

To construct KD1-SPB, the ca. 300 bp of the E4 promoter was deleted and the B-500 
version (ca. 500 bp) of SPB promoter was inserted (Yan et aL, supra) (Figure 24 C, D). We 
selected the SPB promoter because of its strict tissue specificity: it is exclusively active in 
1 5 type II alveolar cells and bronchial epithelial cells of the lung (Bohinski et al., 1 994, Mol 
Cell Biol 74:5671-5681, 1994). Since the parental virus KD1 contains and expresses two 
El A mutations that restrict virus replication to tumor cells (Doronin et al., supra), we 
anticipated that the virus would selectively replicate in cells derived from lung tumors. Thus, 
H44 1 cells, a papillary lung carcinoma cell line, were used to characterize the replication, 
20 gene expression, and functional profile of KD 1 -SPB. 

KD1-SPB formed plaques 3-4 days sooner on 293-E4 cells that express E4 proteins 
than on 293 cells, whereas KD1 formed plaques with the same kinetics on both cell lines. 
These data show that the E4 promoter is active in 293 cells, and that the SPB promoter 
displays very low activity in 293 cells. It is not clear why KD1-SPB forms plaques on 293 
25 cells; these cells are derived from human embryonic kidney and at least one of the 

transcription factors regulating the SPB promoter (Bohinski et al., supra), hepatocyte nuclear 
factor 3, is expressed in embryonic kidney. It is also possible that TTF1, the master 
regulatory factor of SPB expression, is minimally active in 293 cells. 

KD1 grew to equally high titers in H441 and Hep 3B cells (Figure 28A, B). In 
30 contrast, KD1-SPB replicated as efficiently as KD1 in H441 cells, in which the SPB promoter 
is active (Yan et al., supra) (Figure 28A), but replicated poorly in Hep 3B cells, most likely 
because the SPB promoter is inactive (Figure 28B). This selectivity has been confirmed by 
measuring viral DNA production in the two cell lines. KD1-SPB DNA replication was 
similar both kinetically and quantitatively to KD1 DNA replication in H441, however in Hep 
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3B cells, KD1-SPB DNA was almost undetectable (Figure 27 A, B). The cytopathic effect, a 
surrogate marker of Ad replication, showed a similar specificity (Figure 26). 

To further confirm our predictions on the molecular basis of the observed issue 
specificity we monitored viral protein expression. When cells were infected with KD1-SPB 
5 all the viral proteins early or late, except for El A, were expressed in a tissue-specific fashion 
(high expression in H441, low to undetectable expression in Hep 3B) (Figures 29-31). We 
found a good correlation between the levels of E4 promoter activity (E40RF3 expression) 
and the expression of E2A-DBP, ADP, and fiber proteins. Thus, the SPB promoter retains its 
tissue specificity in the Ad genome and it seems to be the limiting factor of Ad gene 

1 0 expression in the cell lines tested. As expected, expression of El A is not tissue-specific. 
Thus, the regulatory step of tissue-specific Ad DNA replication is downstream of E1A. In 
Hep 3B cells, KD1-SPB expressed El A at a higher level than did KD1 (Figure 29), strongly 
suggesting that KD1-SPB replication in most of the Hep3B cells remains at the early stage. 
The cytolytic effect of KD1-SPB also showed a tissue-specific profile (Figure 32; 

15 Figure 13 of Example 6), i.e., preferential lysis of H441 cells over Hep 3B cells, a pattern 
similar to the specificity observed at the level of DNA replication (Figure 27) and viral 
protein synthesis (Figures 29-31). This cell type specificity was also observed when these 
cells were growing as tumors in nude mice. Growth of H441 tumors was suppressed by KD1- 
SPB and KD1 at similar efficacy (Figure 33A). In contrast, KD1-SPB unlike KD1 had only 

20 minimal effect on the growth of Hep 3B tumors (Figure 33B). 

In summary, substitution of the E4 promoter with a tissue specific promoter allows 
highly tissue specific replication of Ad vectors and in the target tissue it is as efficient as the 
replication of the parental virus. KD1-SPB lacks all E3 genes except ADP. E3 gp!9K, RID 
and 14.7K have been shown to protect Ad-infected cells from attack by cytotoxic 

25 lymphocytes and apoptosis-inducing cytokines such as tumor necrosis factor and Fas ligand 
(Wold et al., pp. 200-232 In A.J. Carm (ed.), DNA Virus Replication: Frontiers in Molecular 
Biology, Oxford University Press, Oxford, 2000; Wold et al., Curr. Opiru Immunol 77:380- 
386, 1999). 

The therapeutic index (virus produced in H441 cells compared to Hep 3B cells) of 
30 KD1-SPB is 10 4 - 10 5 for the first 4-5 days (Figure 28). These data compare to data reported 
by Calydon (10M0 5 ) for their prostate specific viruses (Rodriguez et al., supra; Yu et al., 
Cancer Res. 59, 4200-4203, 1999; Yu etal., Cancer Res. 59:1498-1504, 1999). We suggest 
that KD1-SPB has some added advantage over vectors reported by other laboratories because 
it encodes a mutant form of £1 A that restricts replication to cancer cells (Doronin et al., 
35 supra). 
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Although the lung ranks as die second highest cancer site for both men and women in 
the U.S. Reis et al., Cancer Res. 55:2398-2424, 2000), lung cancer has not been a major target 
for cancer vector gene therapy since intratumoral injection of virus is generally not feasible in 
the lungs. However, there has been a recent report of intra tumor injection of a replication- 
5 defective Ad vector into a lung tumor, and such an approach could be attempted with KD1- 
SPB. It may also be feasible to administer KD1-SPB systemically in the lung. 

In view of the above, it will be seen that the several advantages of the invention 
are achieved and other advantageous results attained. 

As various changes could be made in the above methods and compositions 
1 0 without departing from the scope of the invention, it is intended that all matter contained in 
the above description and shown in the accompanying drawings shall be interpreted as 
illustrative and not in a limiting sense. 

All references cited in this specification, including patents and patent 
applications, are hereby incorporated by reference. The discussion of references herein is 
1 5 intended merely to summarize the assertions made by their authors and no admission is made 
that any reference constitutes prior art Applicants reserve the right to challenge the accuracy 
and pertinence of the cited references. 
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What is Claimed Is : 

1 . A recombinant vector which is replication-competent in a neoplastic cell and 
which o verexpresses an adenovirus death protein. 

2. The recombinant vector of claim 1 wherein the adenovirus death protein 
comprises amino acids 1-26, 41-59, and 63-70 of SEQ ID NO:5, SEQ ID NO:6, SEQ ID 
NO: 7, or SEQ ID NO:8 or a conservatively substituted variant thereof or wherein the 
adenovirus death protein comprises SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or SEQ ID 

5 NO:8. 

3 . The recombinant vector of claim 2 which comprises a recombinant virus. 

4. The recombinant vector of claim 3, wherein the recombinant virus is an 
adenovirus lacking expression of at least one E3 protein selected from the group consisting of: 
gpl9K; RIDa; RID0 and 14.7K. 

5 . The recombinant vector of claim 4 which comprises SEQ ID NO:3 or SEQ 

IDNO:4. 

6. The recombinant vector of claim 3 which is replication-restricted to 
neoplastic cells. 

7. The recombinant vector of claim 6 which comprises SEQ ID NO: 1 or SEQ 

IDNO:2. 

8. The recombinant vector of claim 3, wherein the recombinant adenovirus 
comprises a tissue specific promoter, a tumor specific promoter, or an inducible promoter 
substituted for the E4 promoter. 

9. The recombinant vector of claim 8, wherein the tissue-specific promoter is a 
surfactant protein B promoter. 

10. The recombinant vector of claim 6 which comprises SEQ ID NO: 14, SEQ ID 
NO:15orSEQIDNO:16. 

1 1. The recombinant vector of claim 1, wherein the vector Anther comprises a 
gene encoding an anti-cancer product 

12. The recombinant vector of claim 11, wherein the gene encoding an anti- 
cancer product is in the E3 region of the vector. 

13. A method for promoting death of a neoplastic cell comprising contacting the 
neoplastic cell with at least one vector which is replication competent in the neoplastic cell 
and which overexpresses an adenovirus death protein. 

14. The method of claim 13 wherein the adenovirus death protein comprises 
amino acids 1-26, 41-59, and 63-70 of SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or SEQ 
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ED NO:8 or a conservatively substituted variant thereof or wherein the adenovirus death 
protein comprises SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or SEQ ID NO:8. 

15. The method of claim 14, wherein the vector comprises a recombinant 
adenovirus lacking expression of at least one E3 protein selected from the group consisting of: 
gp!9K; RIDa; RIDP and 14.7K. 

16. The method of claim 15, wherein the neoplastic cell comprises a tumor in a 
patient and the contacting step comprises administering the recombinant adenovirus to the 
tumor. 

1 7. The method of claim 1 6, further comprising the step of passively immunizing 
the patient against the recombinant adenovirus. 

1 8. The method of claim 17, wherein the recombinant adenovirus comprises SEQ 
IDNO:3orSEQIDNO:4. 

19. The method of claim 15, wherein the vector is replication-restricted to 
neoplastic cells. 

20. The method of claim 19, wherein the vector is a recombinant adenovirus 
comprising SEQ ID NO: 1 or SEQ ID NO:2. 

2 1 . The method of claim 1 5, wherein the recombinant adenovirus comprises a 
tissue specific promoter or an inducible promoter substituted for the E4 promoter. 

22. The method of claim 21 , wherein the tissue specific promoter is a surfactant 
protein B promoter. 

23. The method of claim 22, wherein the recombinant adenovirus comprises SEQ 
ID NO: 14, SEQ ID NO:15 or SEQ ID NO:16. 

24. The method of claim 16, further comprising treating the tumor with radiation. 

25. The method of claim 24, comprising administering more than one 
recombinant adenovirus to the tumor and treating the tumor with radiation. 

26. The method of claim 16, further comprising treating the tumor with 
chemotherapy. 

27. The method of claim 26, comprising administering more than one 
recombinant adenovirus to the tumor and treating the tumor with chemotherapy. 

28. The method of claim 16, further comprising administering to the tumor one 
or more replication-defective adenovirus which expresses an anti-cancer gene product, 
wherein the recombinant adenovirus complements spread of the replication-defective 
adenovirus in the tumor. 

29. A composition comprising: 
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a first recombinant virus which is replication competent in a neoplastic cell and 
overexpresses an adenovirus death protein; and 

a second recombinant vims which is replication defective and which expresses an 
5 anti-cancer gene product, 

wherein the first recombinant virus complements replication of the second recombinant virus. 

30. The composition of claim 29 wherein the first recombinant vims comprises a 
recombinant adenovirus lacking expression of at least one E3 protein selected from the group 
consisting of: gpl9K; RIDa; RID? and 14.7K. 

3 1 . The composition of claim 30 wherein the recombinant adenovirus comprises 
a nucleotide sequence selected from the group consisting of: SEQ ID NO:l; SEQ ID NO:2; 
SEQ ID NO: 14; SEQ ID NO: 15; SEQ ID NO: 16; SEQ ID NO:3; or SEQ ID NO:4. 

32. A composition comprising 

a first recombinant virus which is replication-defective in a neoplastic cell 
and which overexpresses an adenovirus death protein, and 

a second recombinant virus which is replication-competent in a neoplastic 

cell. 
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ad5 comple 35935 bp 
ad5 complete genome 
ads comple 



DNA 



SYN 



06-FEB-1999 



REFERENCE 



JOURNAL 
BASE COUNT 
ORIGIN 



ORGANISM 



AUTHORS 



Unknown. 

Unknown 

Unclassified . 

1 (bases X to 35935) 

Self 

Unpublished. 



8367 a 10073 C 



9761 g 



7734 t 



1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GGGGTGGAGT 
61 TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG GCGG AAGTGT 
121 GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGACGGATG TGGCAAAAGT GACGTTTTTG 
181 GTGTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 
241 TAAATTTGGG CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGC QCGTA A TATT TOTCTA GGGCCGCGGG 
361 GACTTTOACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTOTT TTCCGCGTTC 
421 CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TGTAGTGTAT TTATACCCGG 
481 TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCOGC 
541 TCCGACACCG GGACTGAAAA TGAGACATAT TATCTGCCAC GGAGGTGTTA TTACCGAAGA 
601 AATGGCCGCC AGTCTTTTGG ACCAGCTGAT CGAAGAGGTA CTGGCTGATA ATCTTCCACC 
661 TCCTAGCCAT TTTGAACCAC CTACCCtTCA CGAACTGTAT GATTTAGACG T GAOG GCCCC 
721 CGAAGATCCC AACGAGGAGG CGGTTTCGCA GATTTTTCCC GACTCTGTAA TGTTGGCGGT 
781 GCAGGAAGGG ATTGACTTAC TCACTTTTCC GCCGGCGCCC GGTTCTCCGG AGCCGCCTCA 
841 CCTTTCCCGG CAGCCCGAGC AGCCGGAGCA GAGAGCCTTG GGTCCGGTTT CTATGCCAAA 
901 CCTTGTACCG GAGGTGATCG ATCTTACCTG CCACGAGGCT GGCTTTCCAC CCAGTGACQA 
961 CGAGGATGAA GAGGGTGAGG AGTTTGTGTT AGATTATGTG GAGCACCCCG GGCACGGTTG 
1021 CAGGTCTTGT CATTATCACC GGAGGAATAC GGGGGACCCA GATATTATGT GTTCGCTTTG 
1081 CTATATGAGG ACCTGTGGCA TGTTTGTCTA CAGTAAGTGA AAATTATGGG CAGTGGGTGA 
1141 TAGAGTGGTG GGTTTGGTGT GGTAATTTTT TTTTTAATTT TTACAGTTTT GTGGTTTAAA 
1201 GAATTTTGTA TTGTGATTTT TTTAAAAGGT CCTGTGTCTG AACCTGAGCC TGAGCCCGAG 
1261 CCAGAACOGG AGCCTGCAAG ACCTACCCGC CGTCCTAAAA TGGCGCCTGC TATCCTGAGA 
1321 CGCCCGACAT CACCTGTGTC TAGAGAATGC AATAGTAGTA CGGATAGCTG TGACTCCGGT 
1381 CCTTCTAACA CACCTCCTGA GATACACCCG GTGGTCCCGC TGTGCCCCAT TAAACCAGTT 
1441 GCCGTGAGAG TTGGTGGGCG TCGCCAGGCT GTGGAATGTA TCGAGGACTT GCTTAACGAG 
1501 CCTGGGCAAC CTTTGGACTT GAGCTGTAAA CGCCCCAGGC CATAAGGTGT AAACCTGTGA 
1561 TTGCGTGTGT GGTTAAOGCC TTTGT TTCCT GAATGAGTTG ATGTAAGTTT AATAAAGGGT 
1621 GAGATAATGT TTAACTTGCA TGGCGTGTTA AATGGGGCGG GGCT TAAAGQ GTATATAATG 
1681 CGCCGTGGGC TAATCTTGGT TACATCTGAC CTCATGGAGG CTTGGGAGTG TTTGGAAGAT 
1741 T TT TCT B CI Q TGCGTAACTT GCTGGAACAG AGCTCTAACA GTACCTCTTG GTTTTGQAGG 
1801 TTTCTGTGGG GCTCATCCCA GGCAAAGTTA GTCTGCAGAA TTAAGGAGGA TTACAAGTGG 
1861 GAATTTGAAG AGCTTTTGAA ATCCTGTGGT GAGCTGTTTG ATTCTTTGAA TCTGGGTCAC 
1921 CAGGCGCTTT TCCAAGAGAA GGTCATCAAG ACTTTGGATT TTTCCACACC GGGGCGCGCT 
1981 GCGGCTGCTG 1UI&T1TTT TT GAGTTTTATA AAGGATAAAT GGAGCGAAGA AACCCATCTG 
2041 AGCGGGGGGT ACCTGCTGGA TTTTCTGGCC ATGCATCTGT GGAGAGCGGT TGTGAGACAC 
2101 AAGAATCGCC TGCTACTGTT GTCTTCCGTC CGCCCGGCGA TAATACCGAC GGAGGAGCAG 
2161 CAGCAGCAGC AGGAGGAAGC CAGGCGGCGG CGGCAGGAGC AGAGCCCATG GAACCCGAGA 
2221 GCCGGCCTGG ACCCTCGGGA ATGAATGTTG TACAGGTGGC TGAACTGTAT CCAGAACTGA 
2281 GAOGCA TTTT GACAATTACA GAGGATGGGC AGGGGCTAAA GGGGGTAAAG AGGGAGCGGG 
2341 GGGCTTGTGA GGCTACAGAG GAGGCTAGGA ATCTAGCTTT TAGCTTAATG ACCAGACACC 
2401 GTCCTGAGTG TATTACTTTT CAACAGATCA AGGATAATTG CGCTAATGAG CTTGATCTGC 
2461 TGGCGCAGAA GTATTCCATA GAGCAGCTGA CCACTTACTQ GCTGCAGCCA GGGGATGATT 
2521 TTGAGGAGGC TATTAGGGTA TATGCAAAGG TGGCA CTTAG GCCAOATTGC AAQTACAAGA 
2581 TCAGCAAACT TGTAAATATC AGGAATTGTT GCTACATTTC TGGGAACGGG GCCGAGGTGG 
2641 AGATAGATAC GGAGGATA0G GTGGCCTTTA GATGTAGCAT GATAAATATG TGGCCGGGGG 
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2701 TGCTTGGCAT GGACGGGGTG GTTATTATGA ATGTAAGGTT TACTGGCCCC AATTT TAGCG 
2761 GTACGGTTTT CCTGGCCAAT ACCAACCTTA TCCTACACGG TGT AAGCT TC TATGG GTTTA 
2821 ACAATACCTG TGTGGAAGCC TGGACCGATG TAAOGGTTCG GGGCTGTGCC TTTTACTGCT 
2881 GCTGGAAGGG GGTGGTGTGT CGCCCCAAAA GCAGGGCTTC AATTAAGAAA TGCCTCTTTG 
2941 AAAGGTOTAC CTTGGGTATC CTGTCTGAGG GTAACTCCAG GGTGCGCCAC AATGTGGCCT 
3001 CCGACTGTGG TTGCTTCATG CTAGTGAAAA GCGTGGCTGT GATTAAGCAT AACATGGTAT 
3061 GTGGCAACTG CGAGGACAGG GCCTCTCAGA TGCTGACCTG CTCGGACGGC AACTG TCACC 
3121 TGCTCAAGAC CATTCACGTA GCCAGCCACT CTCGCAAGGC CTGGCCAGTG TTTGAGCATA 
3181 ACATACTGAC CCGCTGTTCC TTGCATTTGG GTAACAGGAG GGGGGTGTTC CTACCTTACC 
3241 AATGCAATTT GAGTCACACT AAGATATTGC TTGAGCCCGA GAGCATGTCC AAGGTGAACC 
3301 TGAACX3GGGT GTTTGACATG ACCATGAAGA TCTGGAAGGT GCTGAGGTAC GATGAGACCC 
3361 GCACCAGGTG CAGACCCTGC GAGTGTGGCG GTAAACATAT TAGGAACCAG CCTGTGATGC 
3421 TGGATGTGAC CGAGGAGCTG AGGCCCGATC ACTTGGTGCT GGCCTGCACC CGCGCTGAGT 
3481 TTGGCTCTAG CGATGAAGAT ACAGATTGAG GTACTGAAAT GTGTGGGCGT GGCTTAAGGG 
3541 TGGGAAAGAA TATATAAGGT GGGGGTCTTA TGTAGTTTTG TATCTGTTTT GCAGCAGCCG 
3601 CCGCCGCCAT GAGCACCAAC TCGTTTGATG GAAGCATTGT GAGCTCATAT TTQACAACGC 
3661 GCATGCCCCC ATGGGCCGGG GTGCGTCAGA ATGTGATGGG CTCCAGCATT GATGOTCGCC 
3721 CCGTCCTGCC CGCAAACTCT ACTACCTTGA CCTACGAGAC CGTGTCTGGA ACGCOGTTGG 
3781 AGACTGCAGC CTCCGCCGCC GCTTCAGCCG CTGCAGCCAC CGCCCGCGGG ATTGTGACTG 
3841 ACTTTGCTTT CCTGAGCCCG CTTGCAAGCA GTGCAGCTTC CCGTTCATCC GCCCGCQATG 
3901 ACAAGTTGAC GGCTCTTTTG GCACAATTGG ATTCTTTGAC CCGGGAACTT AATGTCGTTT 
3961 CTCAGCAGCT GTTGGATCTG CGCCAGCAGG TTTCTGCCCT GAAGGCTTCC TCCCCTCCCA 
4021 ATGCGGTTTA AAACATAAAT AAAAAACCAG ACTCTGTTTG GATTTGGATC AAGCAAGTGT 
4081 CTTGCTGTCT TTATTTAGGG-GTTTTGCGCG CGCGGTAGGC CCGGGACCAG CGGTCTCGGT 
4141 CGTTGAGGGT CCTGTGTATT TTTTCCAGGA CGTGGTAAAG GTGACTCTGG ATGTTCAGAT 
4201 ACATGGGCAT AAGCCCGTCT CTGGGGTGGA GGTAGCACCA CTGCAGAGCT TCATGCTGCG 
4261 GGGTGGTGTT GTAGATGATC CAGTCGTAGC AGGAGCGCTG GGCGTGGTGC CTAAAAATGT 
4321 CTTTCAGTAG CAAGCTGATT GCCAGGGGCA GGCCCTTGGT GTAAGTGTTT ACAAAGCGGT 
4381 TAAGCTGGGA TGGGTGCATA CGTGGGGATA TGAGATGCAT CTTGGACTGT ATTTTTAGGT 
4441 TGGCTATGTT CCCAGCCATA TCCCTCCGGG GATTCATGTT GTGCAGAACC ACCAGCACAG 
4501 TGTATCCGGT GCACTTGGGA AATTTGTCAT GTAGCTTAGA AGGAAATGCG TGGAAGAACT 
4561 TGGAGACGCC CTTGTGACCT CCAAGATTTT CCATGCATTC GTCCATAATG ATGGCAATGG 
4621 GCCCACGGGC GGCGGCCTGG GCGAAGATAT TTCTGGGATC ACTAACGTCA TAGTTGTGTT 
4681 CCAGGATGAG ATCGTCATAG GCCATTTTTA CAAAGCGCGG GCGGAGGGTG CCAGACTGCG 
4741 GTATAATGGT TCCATCCGGC CCAGGGGCGT AGTTACCCTC ACAGATTTGC ATTTCCCACG 
4801 CTTTGAGTTC AGATGGGGGG ATCATGTCTA CCTGCGGGGC GATGAAGAAA ACGGTTTCCG 
4861 GGGTAGGGGA GATCAGCTGG GAAGAAAGCA GGTTCCTGAG CAGCTGCGAC TTACCGCAGC 
4921 CGGTGGGCCC GTAAATCACA CCTATTACCG GGTGCAACTG GTAGTTAAGA GAG CTGCAGC 
4981 TGCCGTCATC CCTGAGCAGG GGGGCCACTT CGTTAAGCAT GTCCCTQACT CGCATGTTTT 
5041 CCCTCACCAA ATCCGCCAGA AGGCGCTCGC CGCCCAGCGA TAGCAGTTCT TGCAAGGAAG 
5101 CAAAGTTTTT CAACGGTTTG AGACCGTCCG CCGTAGGCAT GCTTTTGAGC GTTTGACCAA 
5161 GCAGTTCCAG GCGGTCCCAC AGCTCGGTCA CCTGCTCTAC GGCATCTCGA TCCAGCATAT 
5221 CTCCTCGTTT CGCGGGTTGG GGCGGCTTTC GCTGTACGGC AGTAGTCGGT QCTCGTCCAG 
5281 ACGGGCCAGG GTCATGTCTT TCCACGGGCG CAGGGTCCTC GTCAGCGTAG TCTGGGTCAC 
5341 GGTGAAGGGG TGCGCTCCGG GCTGCGCGCT GGCCAGGGTG CGCTTGAGGC TGGTCCTOCT 
5401 GGTGCTGAAG CGCTGCCGGT CTTCGCCCTG CGCGTCGGCC AGQTAGCATT TGACCATGGT 
5461 GTCATAGTCC AGCCCCTCCG CGGCGTGGCC CTTGGCGCGC AGCTTGCCCT TGGAGGAGGC 
5521 GCCGCACGAG GGGCAGTGCA GACTTTTGAG GGCGTAGAGC TTGGGCGCGA GAAATACCGA 
5581 TTCCGGGGAG TAGGCATCCG CGCCGCAGGC CCCGCAGACG GTCTCGCATT CX31CGAGCCA 
5641 GGTGAGCTCT GGCCGTTCGG GGTCAAAAAC CAGGTTTCCC CCATGCTTTT TGATOCOTTT 
5701 CTTACCTCTG GTTTCCATGA GCOGGTGTCC ACGCTCGGTG ACGAAAAGGC TOTCOGTGTC 
5761 CCCGTATACA GACTTGAGAG GCCTGTCCTC GAGCX3GTGTT CCGCGGTCCT CCTCOTATAG 
5821 AAACTCGGAC CACTCTOAGA CAAAGGCTCG CGTCCAGGCC AGCACGAAGG AGGCTAAGTG 
5881 GGAGGGGTAG CGQ TC QTTG T CCACTAGGGG GTCCACTCGC TCCAGGGTGT GAAGACACAT 
5941 GTCGCCCTCT TCGGCATCAA GGAAGGTGAT TGGTTTGTAG GTOTAGGCCA OGTGACCGGG 
6001 TGTTCCTGAA GGGGGGCTAT AAAAGGGGGT GGGGGCGCGT TOGTCCTCAC TCTCTTCOGC 
6061 ATCGCTGTCT GCGAGGGCCA GCTGTTGGGG TGAGTACTCC CTCTGAAAAG CGGGCATGAC 
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6121 TTCTGCGCTA AGATTGTCAG TTTCCAAAAA CGAGGAGGAT TTGATATTCA CCTGGCCCGC 
6181 GGTGATGCCT TTGAGGGTGG CCGCATCCAT CTGGTCAGAA AAGACAATCT TTTTGTTGTC 
6241 AAGCTTGGTG GCAAACGACC CGTAGAGGGC GTTGGACAOC AACTTGGCGA TGGAGCGCAG 
6301 GGTTTGGTTT TTGTCGCGAT CGGCGCGCTC CTTGGCCGCG ATGTTTAGCT GCACGTATTC 
6361 GCGCGCAACG CACCGCCATT CGGGAAAGAC GGTGGTQCGC TCGTCGGGCA CCAGGTGCAC 
6421 GCGCCAACCG CGGTTGTGCA GGGTGACAAG GTCAACGCTG GTGGCTACCT CTCCGCGTAG 
6481 GCGCTCGTTG GTCCAGCAGA GGCGGCCGCC CTTGCGCGAG CAGAATGGCG GTAGGGGGTC 
6541 TAGCTGCGTC TCGTCCGGGG GGTCTGCQTC CACGGTAAAG ACCCCGGGCA GCAGGCGCGC 
6601 GTCGAAGTAG TCTATCTTGC ATCCTTGCAA GTCTAGOGCC TGCTGCCATG CGCGGGCGGC 
6661 AAGCGCGOGC TCGTATGGGT TGAGTGGGGG ACCCCATGGC ATGGGGTGGG TGAGCGCGGA 
6721 GGCGTACATG CCGCAAATGT CGTAAACGTA GAGGGGCTCT CTGAGTATTC CAAGATATGT 
6781 AGGGTAGCAT CTTCCACCGC GGATGCTGGC GCGCACGTAA TCGTATAGTT CGTGCGAGGG 
6841 AGCGAGGAGG TCGGGACCGA GGTTGCTACG GGCGGGCTGC TCTGCTCGGA AGACTATCTG 
6901 CCTGAAGATG GCATGTGAGT TGGATGATAT GGTTGGACGC TGGAAGAOGT TGAAGCTGGC 
6961 GTCTGTGAGA CCTACCGCGT CACGCAOGAA GGAGGCGTAG GAGTCGCGCA GCTTGTTGAC 
7021 CAGCTCGGCG GTGACCTGCA CGTCTAGGGC GCAGTAGTCC AGGGTTTCCT TGATGATGTC 
7081 ATACTTATCC TGTCCCTTTT TTTTCCACAG CTCGCGGTTG AGOACAAACT CTTCGCGGTC 
7141 TTTCCAGTAC TCTTGGATCG GAAACCCGTC GGCCTCCGAA CGGTAAGAGC CTAGCATGTA 
7201 GAACTGGTTG ACGGCCTGGT AGGCGCAGCA TCCCTTTTCT ACGGGTAGCG CGT ATGCC TG 
7261 CGCGGCCTTC CGGAGCGAGG TGTGGGTGAG CGCAAAGGTG TCCCTGACCA TGACTTTGAG 
7321 GTACTGGTAT TTGAAGTCAG TGTCGTCGCA TCCGCCCTGC TCCCAGAGCA AAAAGTCCGT 
7381 GCGCTTTTTG GAACGCGGAT TTGGCAGGGC GAAGGTGACA TCGTTGAAGA GTATCTTTCC 
7441 CGCGCGAGGC ATAAAGTTGC GTGTGATGCG GAAGGGTCCC GGCACCTOGG AACGGTTGTT 
7501 AATTACCTGG GCGGCGAGCA CGATCTOGTC AAAGCCGTTG ATGTTG TGGC CCACAATOTA 
7561 AAGTTCCAAG AAGCGCGGGA TGCCCTTGAT GGAAGGCAAT TTTTTAAGTT CCTCGTAGGT 
7621 GAGCTCTTCA GGGGAGCTGA GCCCGTGCTC TGAAAGGGCC CAGTC TGCAA GATGAGGGTT 
7681 GGAAGCGACG AATGAGCTCC ACAGGTCACG GGCCATTAGC ATTTGCAGGT GGTCGCGAAA 
7741 GGTCCTAAAC TGGCGACCTA TGGCCATTTT TTCTGGGGTG ATGCAGTAGA AGGTAAGOGG 
7801 GTCTTGTTCC CAGCGGTCCC ATCCAAGGTT CGCGGCTAGG TCTCGCGCGG CAGTCACTAG 
7861 AGGCTCATCT CCGCOGAACT TCATGACCAG CATGAAGGGC ACGAGCTGCT TCCCAAAGGC 
7921 CCCCATCCAA GTATAGGTCT CTACATCGTA GGTGACAAAG AGACGCTCGG TGCQAGQATO 
7981 CGAGCCGATC GGGAAGAACT GGATCTCCCG CCACCAATTG GAGGAGTGGC TATTGATGTG 
8041 GTGAAAGTAG AAGTCCCTGC GACGGGCCGA ACACTCGTGC TGGCTTTTGT AAAAAOGTGC 
8101 GCAGTACTGG CAGCGGTGCA CGGGCTGTAC ATCCTGCACG AGGTTGACCT GACGACCGCG 
8161 CACAAGGAAG CAGAGTGGGA ATTTGAGCCC CTCGCCTGGC GGGTTTGGCT GGTGGTCTTC 
8221 TACTTCGGCT GCTT QTCC TT GACCGTCTGG CTGCTCGAGG GGAGTTACGG T GGAT CGGAC 
8281 CACCACGCCG CGCGAGCCCA AAGTCCAGAT GTCCGCGCGC GGCGGTCGGA GCTTGATGAC 
8341 AACATOGCGC AGATGGGAGC TGTCCATGGT CTGGAGCTCC CGCGGCGTCA GGTCAGGOGG 
8401 GAGCTCCTGC AGGTTTACCT CGCATAGACG GGTCAGGGOG CGGGCTAGAT CCAGGTGATA 
8461 CCTAATTTCC AGGGGCTGGT TCGTGGOGGC GTCGA1GGCT TGCAAGAGGC CGCATCCCCG 
8521 CGGCGCGACT AOGGTACCGC GCGGCGGGCG OTGGGCCGCG GGGGTGTCCT TGGATGATGC 
8581 ATCTAAAAGC GGTGACGCGG GCGAGCCCCC GGAGGTAGGG GGGGCTCCGG ACCCGCCGGG 
8641 AGAGGGGGCA GGGGCACOTC GGCGCCGCGC GCGGGCAGGA GCTGGTGCTG CGOGCGTAGG 
8701 TTGCTGGCQA ACGCGACGAC GCGGOGGTTG ATCTCCTOAA TCTGGCGCCT CTGCaTQAAG 
8761 ACGACGGGCC CGGTGAGCTT GAGCCTGAAA GAGAGTTOGA CAOAATCAAT TTCGGTGTOG 
8821 TTGACGGOGG CCTGGCGCAA AATCTCCTGC ACGTCTCCTG AGTTGTCTTG ATAGGCGATC 
8881 TCGGCCATGA ACTGCTCGAT CTCTTCCTCC TGGAGATCTC OGCGTCCGGC TCGCTCCACG 
8941 GTGGCGGCGA GGTCGTTGGA AATGCGGGCC ATGAGCTGCG AGAAGGCGTT GAGGCCTCCC 
9001 TCGTTCCAGA CGOGGCTGTA GAOCAOGCCC CCTTCGGCAT CGOGGGCGOG CATGACCACC 
9061 TGCGCGAGAT TGAOCTCCAC GTGCCGGGCG AAGACGGCGT AGTTTCGCAG GCGCTGAAAG 
9121 AGGTAGTTGA GGGTGGTGGC GOTOTGTTCT GCCAOGAAGA AGTACATAAC CCAGCGTCGC 
9181 AACGTGGATT CGTTGATATC CCCCAAGGCC TCAAGGCGCT CCATGGCCTC GTAOAAGTCC 
9241 AOGGCGAAGT TGAAAAACTG GGAGTTGCGC GCCGACAOGG TTAACTCCTC CTCCAQAAGA 
9301 CGGATGAGCT CGGCGACAGT GTCGCGCACC TCGCGCTCAA AGGCTACAGG GGOCTCTTCT 
9361 TCTTCTTCAA TCTCCTCTTC CATAAGGGCC TCCCCTTCTT CTTCTTCTGG CGGCGGTGGG 
9421 GGAGGGGGGA CAOGGCGGCG ACGACGGCGC ACOGGGAGGC GGT COACAAA GCGCTCGATC 
9481 ATCTCCCCGC GGCGACGGCG CATGGTCTCG GTGACGGOGC GGCOGTTCTC GCGGGGGOGC 
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9541 AGTTGGAAGA CGCCGCCCGT CATGTCCCGG TTATGGGTTG GCGGGGGGCT GCCATGCGGC 
9601 AGGGATACGG CGCTAACGAT GCATCTCAAC AATTGTTQTG TAGGTACTCC GCCGCCGAGG 
9661 GACCTGAGCG AGTCCGCATC GACCGGATCG GAAAACCTCT CGAGAAAGGC GTCTAACCAG 
9721 TCACAGTCGC AAGGTAGGCT GAGCACCGTG GCX3GGCGGCA GCGG GCGG CG GTCGGGGTTG 
9781 TTTCTGGCGG AGGTGCTGCT GATGATGTAA TTAAAGTAGG CGGTCTTGAG ACGGCGGATG 
9841 GTCGACAGAA GCACCATGTC CTTGGGTCCG GCCTGCTGAA TGCGCAGGCG GTCGGCCATG 
9901 CCCCAGGCTT CGTTTTGACA TCGGCGCAGG TCTTTGTAGT AGTCTTGCAT GAGCCTTTCT 
9961 ACCGGCACTT CTTCTTCTCC TTCCTCTTGT CCTGCATCTC TTGCATCTAT CGCTGCGGCG 
10021 GCGGCGGAGT TTGGCCGTAG GTGGCGCCCT CTTCCTCCCA TGCGTGTGAC CCCGAAGCCC 
10081 CTCATCGGCT GAAGCAGGGC TAGGTCGGCG ACAACGCGCT CGGCTAATAT GGCCTGCTGC 
10141 ACCTGCGTGA GGGTAGACTG GAAGTCATCC ATGTCCACAA AGCGGTGGTA TGCGCCCGTG 
10201 TTGATGGTGT AAGTGCAGTT GGCCATAACG GACCAGTTAA CGGTCTGGTG ACCCGGCTGC 
10261 GAGAGCTCGG TGTACCTGAG ACGCGAGTAA GCCCTCGAGT CAAATACGTA GTCGTTGCAA 
10321 GTCCGCACCA GGTACTGGTA TCCCACCAAA AAGTGCGGCG GOGGCTGGCG GTAGAGGGGC 
10381 CAGCGTAGGG TGGCCGGGGC TCCGGGGGCG AGATCTTCCA ACATAAGGCG ATGATATCCG 
10441 TAGATGTACC TGGACATCCA GGTGATGCCG GCGGCGGTGG TGGAGGCGCG CGGAAAGTCG 
10501 CGGACGCGGT TCCAGATGTT GCGCAGCGGC AAAAAGTGCT CCATGGTCGG GACGCTCTGG 
10561 CCGGTCAGGC GCGCGCAATC GTTGACGCTC TAGACCGTGC AAAAGGAGAG CCTGTAAGCG 
10621 GGCACTCTTC CGTGGTCTGG TGGATAAATT CGGAAGGGTA TCATGGCGGA CGACCGGGGT 
10681 TCGAGCCCCG TATCCGGCCG TCCGCCGTGA TCCATGCGGT TACCGCCCGC GTGTCGAACC 
10741 CAGGTGTGCG ACGTCAGACA ACGGGGGAGT GCTCCTTTTG GCTTCCTTCC AGGCGCGGCG 
10801 GCTGCTGCGC TAGCTTTTTT GGCCACTGGC CGCGCGCAGC GTAAGCGGTT AGGCTGGAAA 
10861 GCGAAAGCAT TAAGTGGCTC GCTCCCTGTA GCCGOAGGGT TATTTTCCAA GGGTT GAGTC 
10921 GCGGGACCCC CGGTTCGAGT CTCX5GACCGG CCGGACTGCG GCGAACGGGG GTTTGCCTCC 
10981 CCGTCATGCA AGACCCCGCT TGCAAATTCC TCCGGAAACA GGGACGAGCC CCTTTTTTGC 
11041 TTTTCCCAGA TGCATCCGGT GCTGCGGCAG ATGCGCCCCC CTCCTCAGCA GCGGCAAGAG 
11101 CAAGAGCAGC GGCAGACATG CAGGGCACCC TCCCCTCCTC CTACCGCGTC AGGAGGGGCG 
11161 ACATCOGCGG TTGACGCGGC AGCAGATGGT GATTACGAAC CCCCGOGGCG CCGGGCCCGG 
11221 CACTACCTGG ACTTGGAGGA GGGCGAGGGC CTGGCGCGGC TAGGAGGGCC CTCTCCTGAG 
11281 CGGTACCCAA GGGTGCAGCT GAAGCGTGAT ACGCGTGAGG CGTACGTGCC GCGGCAGAAC 
11341 CTGTTTCGCG ACCGCGAGGG AGAGGAGCCC GAGGAGATGC GGGATCGAAA GTT CCACG CA 
11401 GGGCGCGAGC TGCGGCATGG CCTGAATCGC GAGCGGTTGC TGCGCGAGGA GGACTTTGAG 
11461 CCCGACGCGC GAACCGGGAT TAGTCCCGCG CGCGCACACG TGGOGGCCGC CGACCTGGTA 
11521 ACCGCATACG AGCAGACGGT GAACCAGGAG ATTAACTTTC AAAAAAGCTT TAACAACCAC 
11581 GTGCGTACGC TTGTGGCGCG CGAGGAGGTG GCTATAGGAC TGATGCATCT GTGGGACTTT 
11641 GTAAGCGCGC TGGAGCAAAA CCCAAATAGC AAGCCGCTCA TGGCGCAGCT GTTCCTTATA 
11701 GTGCAGCACA GCAGGGACAA CGAGGCATTC AGGGATGCGC TGCTAAACAT AGTAGAGCCC 
11761 GAGGGCCGCT GGCTGCTCGA TTTGATAAAC ATCCTGCAGA GCATAGTGGT GCAGGAGCGC 
11821 AGCTTGAGCC TGGCTGACAA GGTGGCCGCC ATCAACTATT CCATGCTTAG CCTGGGCAAG 
11881 TTTTACGCCC GCAAGATATA CCATACCCCT TAOGTTCCCA TAGACAAGGA GGTAAAGATC 
11941 GAGGGGTTCT ACATOCGCAT GGCGCTGAAG GTGCTTACCT TGAGCGACGA CCTGGGCGTT 
12001 TATCGCAACG AGCGCATCCA CAAGGCCGTG AGCGTGAGCC GGCGGCGCGA GCTCAGCGAC 
12061 CGCGAGCTGA TGCACAGCCT GCAAAGGGCC CTGGCTGGCA CGGGCAGCGG CGATAGAGAG 
12121 GCCGAGTCCT ACTTTGACGC GGGCGCTGAC CTGCGCTGGG CCCCAAGCCG ACGCGCCCTG 
12181 GAGGCAGCTG GGGCCGGACC TGGGCTGGCG GTGGCACCCG CGCGCGCTGG CAACGTCGGC 
12241 GGCGTGGAGG AATATGACGA GOACGATGAG TACGAGCCAO AGGACGGCGA GTACTAAGCG 
12301 GTGATGTTTC TGATCAGATG ATGCAAGACG CAACGGACCC GGCGGTGCGG GOGGCGCTGC 
12361 AGAGCCAGCC GTCCGGCCTT AACTCCACGG ACGACTGGCG CCAGOTCATG GACCGCATCA 
12421 TGTCGCTGAC TGCGCGCAAT CCTGACGCGT TCCGGCAGCA GCCGCAGGCC AACCGGCTCT 
12481 CCGCAATTCT GGAAGCGGTG GTCCCGGCGC GCGCAAACCC CACGCACGAG AAGGTGCTGG 
12541 CGATCGTAAA CGCGCTGGCC GAAAACAGGG CCATCCGGCC OGACGAGGCC GGCCTGGTCT 
12601 ACGACGCGCT GCTTCAGCGC GTGGCTCGTT ACAACAGCGG CAACGTGCAG ACCAACCTGG 
12661 ACCGGCTGGT GGGGGATGTG CGOGAGGCOG TGGOGCAGCG TGAGCGCGCG CAGCAGCAGG 
12721 GCAACCTGGG CTCCATGGTT GCACTAAACG CCTTCCTGAG TACACAGCCC GCCAACGTGC 
12781 CGCGGGGACA GGAGGACTAC ACCAACTTTG TGAGCGCACT GCGGCTAATG GTGACTGAGA 
12841 CACCGCAAAG TGAGGTGTAC CAGTCTGGGC CAGACTATTT TTTCCAOACC AGTAGACAAG 
12901 GCCTGCAGAC CGTAAACCTG AGCCAGGCTT TCAAAAACTT GCAGGGGCTG TOGGGGGTGC 
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12961 GGGCTCCCAC AGGCGACCGC GCGACCGTGT CTAGCTTGCT GACGCCCAAC TCGCGCCTGT 
13021 TGCTGCTGCT AATAGCGCCC TTCACGGACA GTGGCAGCGT GTCCCGGGAC ACATACCTAG 
130 Bl GTCACTTGCT GACACTGTAC CGCGAGGCCA TAGGTCAGGC GCATGTGGAC GAGCATACTT 
13141 TCCAGGAGAT TACAAGTGTC AGCCGCGCGC TGGGGCAGGA GGACACGGGC AGCC TGGAG G 
13201 CAACCCTAAA CTACCTGCTG ACCAACCGGC GGCAGAAGAT CCCCTCGTTG CACAGTTTAA 
13261 ACAGCGAGGA GGAGCGCATT TTGCGCTACG TGCAGCAGAG CGTGAGCCTT AACCTGATGC 
13321 GOGACGGGGT AACGCCCAGC GTGGCGCTGG ACATGACCGC GCGCAACATG GAACCGGGCA 
13381 TGTATGCCTC AAACCGGCCG TTTATCAACC GCCTAATGGA CTACTTGCAT CGCGCGGCCG 
13441 CCGTCAACCC CGAGTATTTC ACCAATGCCA TCTTGAACCC GCACTGGCTA CCGCCCCCTG 
13501 GTTTCTACAC CGGGGGATTC GAGGTGCCCG AGGGTAACGA TGGATTCCTC TGGGACGACA 
13561 TAGACGACAG CGTGTTTTCC CCGCAACCGC AGACCCTGCT AGAGTTGCAA CAGCGCGAGC 
13621 AGGCAGAGGC GGCGCTGCGA AAGGAAAGCT TCCGCAGGCC AAGCAGCTTQ TCCGATCTAG 
13681 GCGCTGCGGC CCCGOGGTCA GATGCTAGTA GCCCATTTCC AAGCTTGATA GGQTCTCTTA 
13741 CCAGCACTCG CACCACCCGC CCGCGCCTGC TGGGCGAGGA GGAGTACCTA AACAACTCGC 
13801 TGCTGCAGCC GCAGCGOGAA AAAAACCTGC CTCCGGCATT TCCCAACAAC GGGATAGAGA 
13861 GCCTAGTGGA CAAGATOAGT AGATGGAAGA CGTACGCGCA GGAGCACAGG GACGTGCCAG 
13921 GCCCGCGCCC GCCCACCCGT CGTCAAAGGC ACGACCGTCA GCGGGGTCTG GTGTGGGAGG 
13981 ACGATGACTC GGCAGACGAC AGCAGCGTCC TGGATTTGGG AGGGAGTGGC AACCCGTTTG 
14041 CGCACCTTCG CCCCAGGCTQ GGGAGAATGT TTTAAAAAAA AAAAAGCATG ATGCAAAATA 
14101 AAAAACTCAC CAAGGCCATG GCACCGAGCG TTGGTTTTCT TGTATTCCCC TTAGTATGCG 
14161 GCGCGCGGCG ATGTATGAGG AAGGTCCTCC TCCCTCCTAC GAGAGTGTGG TGAGOGCGGC 
14221 GCCAGTGGCG GCGGCGCTGG GTTCTCCCTT CGATGCTCCC CTGGACCCGC CGTTTGTGCC 
14281 TCCGCGGTAC CTGCGGCCTA CCGGGGGGAG AAACAGCATC CGTTACTCTG AGTTGGCACC 
14341 CCTATTCGAC ACCACCCGTG TGTACCTGGT GGACAACAAG TCAACGGATG TGGCATCCCT 
14401 GAACTACCAG AACGACCACA GCAACTTTCT GACCACGGTC ATTGAAAACA ATGACTACAG 
14461 CCGGGGGGAG GCAAGCACAC AGACCATCAA TCTTGACGAC CGGTCGCACT GGGGCGGCGA 
14521 CCTGAAAACC ATCCTGCATA CCAACATGCC AAATGTGAAC GAGTTCATGT TTACCAATAA 
14581 GTTTAAGGCG CGGGTGATGG TGTCGCGCTT GCCTACTAAG GACAATCAGG TGGAGCTGAA 
14641 ATACGAGTGG GTGGAGTTCA CGCTGCCCGA GGGCAACTAC TCCGAGACCA TGACCATAGA 
14701 CCTTATGAAC AACGCGATCG TGGAGCACTA CTTGAAAGTG GGCAGACAGA ACGGGGTTCT 
14761 GGAAAGCGAC ATCGGGGTAA AGTTTGACAC CCGCAACTTC AGACTGGGGT TTGACCCCGT 
14821 CACTGGTCTT GTCATGCCTG GGGTATATAC AAACGAAGCC TTCCA TCCAG ACATCATTTT 
14881 GCTGCCAGGA TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT TGGGCATCCG 
14941 CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTGG AGGGTGGTAA 
15001 CATTCCCGCA CTGTTGGATG TGGACGCCTA CCAGGCGAGC TTGAAAGATG ACAOCGAACA 
15061 GGGCGGGGGT GGCGCAGGCG GCAGCAACAG CAGTGGCAGC GGCGCGGAAG AGAACTCCAA 
15121 CGCGGCAGCC GCGGCAATGC AGCCGGTGGA GGACATGAAC GATCATGCCA TTCGCGGCGA 
15181 CACCTTTGCC ACACGGGCTG AGGAGAAGCG CGCTOAGGCC GAAGCAGCGG CCOAAGCTGC 
15241 CGCCCCCGCT GCGCAACCCG AGGTCGAGAA GCCTCAGAAG AAACCGGTOA TCAAACCCCT 
15301 GACAGAGGAC AGCAAGAAAC GCAGTTACAA CCTAATAAGC AATGACAGCA CCTTCACCCA 
15361 GTACCGCAGC TGGTACCTTG CATACAACTA CGGCGACCCT CAGACCGGAA TCCGCTCATG 
15421 GACCCTGCTT TGCACTCCTG ACGTAACCTG CGGCTCGGAG CAGGTCTACT GGTCGTTGCC 
15481 AGACATGATG CAAGACCCCG TGACCTTCCG CTCCACGCGC CAGATCAGCA ACTTTCCGGT 
15541 GOTGGGCGCC GAGCTGTTGC CCGTGCACTC CAAGAGCTTC TACAACGACC AGGCCGTCTA 
15601 CTCCCAACTC ATCCGCCAGT TTACCTCTCT GACCCACGTG TTCAATCGCT TTCCCGAGAA 
15661 CCAGATTTTG GCGCGCCCGC CAGCCCCCAC CATCACCACC GTCAGTGAAA ACOTTCCTGC 
15721 TCTCACAGAT CACGGGACGC TACCGCTGCG CAACAGCATC GGAGGAGTCC AGCQAGTOAC 
15781 CATTACTGAC GCCAGACGCC GCACCTGCCC CTACGTTTAC AAGGCCCTGG GCATAGTCTC 
15841 GCCGCGCQTC CTATCGAGCC GCACTTTTTG AGCAAGCATG TCCATCCTTA TATCGCCCAG 
15901 CAATAACACA GGCTGGGGCC TGCGCTTCCC AAGCAAGATG TTTGGCGGGG CCAAGAAOCO 
15961 CTCCGACCAA CACCCAGTGC GCGTGCGCGG GCACTACOGC GCGCCCTGGG GCGCGCACAA 
16021 ACGCGGCCGC ACTGGGCGCA CCACCGTCGA TGACGCCATC GACGCGGTGG TGGAGGAGGC 
16081 GCGCAACTAC ACGCCCACGC CGCCACCAGT GTCCACAGTG GACGCGGCCA TTCAGACCOT 
16141 GGTGCGCGGA GCCCGGCGCT ATGCTAAAAT GAAGAGACGO CGGAGGCGCG TAGCACGTCG 
16201 CCACCGCOGC COACCCGGCA CTGCOGCCCA ACGCGCGGCG GCGGCCCTGC TTAACCGCGC 
16261 ACGTCGCACC GGCCGACGGG CGGCCATGCG GGCCGCTCGA AGGCTGGCCG CGGGTATTGT 
16321 CACTGTGCCC CCCAGGTCCA GGCOACGAGC GGCCGCCGCA GCAGCCGCGG CCATTAGTGC 
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16381 TATGACTCAG GGTCGCAGGG GCAACQTGTA TTGGGTGCGC GACTCGGTTA GCGGCCTGCG 
16441 CGTGCCCGTG CGCACCCGCC CCCCGCGCAA CTAGATTGCA AGAAAAAACT ACTTAGACTC 
16501 GTACTGTTGT ATGTATCCAG CGGCGGCGGC GCGCAACGAA GCTATGTCCA AGCGCAAAAT 
16561 CAAAGAAGAG ATGCTCCAGG TCATCGCGCC GGAGATCTAT GGCCCCCCGA AGAAGGAAGA 
16621 GCAGGATTAC AAGCCCCGAA AGCTAAAGCG GGTCAAAAAG AAAAAGAAAG ATGATGATGA 
16681 TGAACTTGAC GACGAGGTGG AACTGCTGCA CGCTACCGCG CCCAGGCGAC GGGTACAGTG 
16741 GAAAGGTCGA CGCGTAAAAC GTGTTTTGCG ACCCGGCACC ACCGTAGTCT TTACGCCCGG 
16801 TGAGCGCTCC ACCCGCACCT ACAAGCGCGT GTATGATGAG GTGTACGGCG ACGAGGACCT 
16861 GCTTGAGCAG GCCAACGAGC GCCTCGGGGA GTTTGCCTAC GGAAAGCGGC ATAAGGACAT 
16921 GCTGGCGTTG CCGCTGGACG AGGGCAACCC AACACCTAGC CTAAAGCCCG TAACACTGCA 
16981 GCAGGTGCTG CCCGCGCTTG CACCGTCCGA AGAAAAGCGC GGCCTAAAGC GCGAGTCTGG 
17041 TGACTTGGCA CCCACCGTGC AGCTGATGGT ACCCAAGCGC CAGCGACTGG AAGATGTCTT 
17101 GGAAAAAATG ACCGTGGAAC CTGGGCTGGA GCCCGAGGTC CGCGTGCGGC CAATCAAGCA 
17161 GGTGGCGCCG GGACTGGGCG TGCAGACCGT GGACGTTCAG ATACCCACTA CCAGTAGCAC 
17221 CAGTATTGCC ACCGCCACAG AGGGCATGGA GACACAAACG TCCCCGGTTG CCTCAGCGGT 
17281 GGCGGATGCC GCGGTGCAGG CGGTCGCTGC GGCCGCGTCC AAGACCTCTA CGGAGGTGCA 
17341 AACGGACCCG TGGATGTTTC GCGTTTCAGC CCCdCGGCGC CCGCGCGGTT CGAGGAAGTA 
17401 CGGCGCCGCC AGCGCGCTAC TGCCCGAATA TGCCCTACAT CCTTCCATTG CGCCTACCCC 
17461 CGGCTATCGT GGCTACACCT ACCGCCCCAG AAGACGAGCA ACTACCCGAC GCCOAACCAC 
17521 CACTGGAACC CGCCGCCGCC GTCGCCGTCG CCAGCCCGTG CTGGCCCCGA TTTCCGTGCG 
17581 CAGGGTGGCT CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT ACCACCCCAG 
17641 CATCGTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG GCCCTCACCT GCCGCCTCCG 
17701 TTTCCCGGTG CCGGGATTCC GAGGAAGAAT GCACCGTAGG AGGGGCATGG CCGGCCACGG 
17761 CCTGACGGGC GGCATGCGTC GTGCGCACCA CCGGCGGCGG CGCGCGTCGC ACCGTCGCAT 
17821 GCGCGGCGGT ATCCTGCCCC TCCTTATTCC ACTGATCGCC GCGGCGATTG GCGCCGTGCC 
17881 CGGAATTGCA TCCGTGGCCT TGCAGGCGCA GAGACACTGA TTAAAAACAA GTTGCATGTG 
17941 GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTOGCTT GGTCCTGTAA CTATTTTGTA 
18001 GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGOGACA CGGCTCGCGC CCGTTCATGG 
18061 GAAACTGGCA AGATATCGGC ACCAGCAATA TGAGCGGTGG CGCCTTCAGC TGGGGCTCGC 
18121 TGTGGAGCGG CATTAAAAAT TTCGGTTCCA CCGTTAAGAA CTATGGCAGC AAGGCCTGGA 
18181 ACAGCAGCAC AGGCCAGATG CTGAGGGATA AGTTGAAAGA GCAAAATTTC CAACAAAAGG 
18241 TGGTAGATGG CCTGGCCTCT GGCATTAGCG GGGTGGTCGA CCTGGCCAAC CAGGCAGTGC 
18301 AAAATAAGAT TAACAGTAAG CTTGATCCCC GCCCTCCCGT AGAGGAGCCT CCACOGGCCG 
18361 TGGAGACAGT GTCTCCAGAG GGGCGTGGCG AAAAGCGTCC GCGCCCCGAC AGGGAAGAAA 
18421 CTCTGGTGAC GCAAATAGAC GAGCCTCCCT CGTACGAGGA GGCACTAAAG CAAGGCCTGC 
18481 CCACCACCCG TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC ACACCCGTAA 
18541 CGCTGGACCT GCCTCCCCCC GCCGACACCC AGCAGAAACC TGTGCTGCCA GGCCCGACCG 
18601 CCGTTGTTGT AACCCGTCCT AGCCGCGCGT CCCTGCGCCG CGCCGCCAGC GGTCCGCGAT 
18661 CGTTGCGGCC CGTAGCCAGT GGCAACTGGC AAAGCACACT GAACAGCATC GTGGGTCTGG 
18721 GGGTGCAATC CCTGAAGCGC CGACGATGCT TCTGAATAGC TAACGTGTCG TATGTGTGTC 
18781 ATGTATGCGT CCATGTCGCC GCCAGAGGAG CTGCTGAGCC GCCGCGCGCC CGCTTTCCAA 
18841 GATGGCTACC CCTTCGATGA TGCCGCAGTG GTCTTACATG CACATCTCGG GCCAGGACGC 
18901 CTCGGAGTAC CTOAGCCCCG GGCTGGTGCA GTTTGCCCGC GCCACCGAGA CGTACTTCAG 
18961 CCTGAATAAC AAGTTTAGAA ACCCCACGGT GGOGCCTACG CACGACGTGA CCACAGACCG 
19021 GTCCCAGCGT TTGACGCTGC GGTTCATCCC TGTGGACCGT GAGGATACTG CGTACTCGTA 
190B1 CAAGGCGCGG TTCACCCTAG CTGTGGGTGA TAACCGTGTG CTGGACATGG CTTCCACGTA 
19141 CTTTGACATC CGCGGCGTGC TGGACAGGGG CCCTACTTTT AAGCCCTACT CTGGCACTGC 
19201 CTACAAOGCC CTGOCTCCCA AGGGTGCCCC AAATCCTTGC GAATGGGATG AAGCTGCTAC 
19261 TGCTCTTGAA ATAAACCTAG AAGAAGAGGA CGATGACAAC GAAGACGAAG TAGACGAGCA 
19321 AGCTGAGCAG CAAAAAACTC ACGTATTTGG GCAGGCGCCT TATTCTGGTA TAAATATTAC 
19381 AAAGGAGGGT ATTCAAATAG GTGTCGAAGG TCAAACACCT AAATATGCCG ATAAAACATT 
19441 TCAACCTGAA CCTCAAATAG GAGAATCTCA GTGGTACGAA ACTGAAATTA ATCATGCAGC 
19501 TGGGAGAGTC CTTAAAAAGA CTACCCCAAT GAAACCATGT TACGGTTCAT ATGCAAAACC 
19561 CACAAATGAA AATGGAGGGC AAGGCATTCT TGTAAAGCAA CAAAATGGAA AGCEAGAAAG 
19621 TCAAGTGGAA ATGCAATTTT TCTCAACTAC TGAGGOGACC GCAGGCAATG GTGATAACTT 
19681 GACTCCTAAA GTGOTATTGT ACAGTGAAGA TGTAGATATA GAAACCCCAG ACACTCATAT 
19741 TTCTTACATG CCCACTATTA AGGAAGGTAA CTCACGAGAA CTAATGGGCC AACAATCTAT 
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19801 GCCCAACAGG CCTAATTACA TTK3CTTTTAG GGACAATTTT ATTGGTCTAA TGTATTACAA 
19861 CAGCACGGGT AATATGGGTG TTCTGGOGGG CCAAGCATCG CAGTTGAATG CTGTTGTAGA 
19921 TTTGCAAGAC AQAAACACAG AGCTTTCATA CCAGCTTTTG CTTCATTCCA TTGGTQATAG 
19981 AACCAGQTAC TTTTCTATGT GGAATCAGGC TOTTGACAGC TATGATCCAO ATGTTAGAAT 
20041 TATTQAAAAT CATGGAACTG AAOATGAACT TCCAAATTAC TGCTTTCCAC TGGOAGGTQT 
20101 GATTAATACA GAGACTCTTA CCAAGGTAAA ACCTAAAACA GGTCAGGAAA ATGGATGGGA 
20161 AAAAGATGCT ACAOAATTTT CAGATAAAAA TGAAATAAGA GTTGGAAATA ATTTTGCCAT 
20221 GGAAATCAAT CTAAATGCCA ACCTGTGGAG AAATTTCCTG TACTCCAACA TAGCGCTGTA 
20281 TTTGCCOGAC AAGCTAAAGT ACAGTCCTTC CAACGTAAAA ATTTCTGATA ACCCAAACAC 
20341 CTACGACTAC ATGAACAAGC GAGTGGTGGC TCCCGGGTTA GTGGACTGCT ACATTAACCT 
20401 TGGAGCACGC TGGTCCCTTG ACTATATGGA CAACGTCAAC CCATTTAACC ACCACCGCAA 
20461 TGCTGGCCTG CGCTACCGCT CAATGTTGCT GGGCAATGGT CGCTATGTGC CCTTCCACAT 
20521 CCAGGTCCCT CAGAAGTTCT TTGCCATTAA AAACCTCCTT CTCCTGCCGG GCTCATACAC 
20581 CTACGAGTGG AACTTCAGGA AGGATGTTAA CATGGTTCTG CAGAGCTCCC TAGGAAATGA 
20641 CCXAAGGGTT GACGGAGCCA GCATTAAOTT TGATAGCATT TGCCTTTACG CCACCTTCTT 
20701 CCCCATGGCC CACAACACCG CCTCCACGCT TGAGGCCATG CTTAGAAACG ACACCAACOA 
20761 CCAGTCCTTT AACGACTATC TCTCCGCCGC CAACATGCTC TACCCTATAC CCGCCAACGC 
20821 TACCAACGTG CCCATATCCA TCCCCTCCCG CAACTGGGCG GCTTTCOGCG GCTGGGCCTT 
20881 CACGCGCCTT AAGACTAAGG AAACCCCATC ACTGGGCTCG GGCTACGACC CTTATTACAC 
20941 CTACTCTGGC TCTATACCCT ACCTAGATGG AACCTTTTAC CTCAACCACA CCTTTAAQAA 
21001 GGTGGCCATT ACCTTTGACT CTTCTGTCAG CTGGCCTGGC AATGACCGCC TGCTTACCCC 
21061 CAACGAGTTT GAAATTAAGC GCTCAGTTGA CGGGGAGGGT TACAACGTTG CCCAGTGTAA 
21121 CATGACCAAA GACTGGTTCC TGGTACAAAT GCTAGCTAAC TACAACATTG GCTACCAGGG 
21181 CTTCTATATC CCAGAGAGCT ACAAGGACCG CATGTACTCC TTCTTTAGAA ACTTCCAGCC 
21241 CATGAGCCGT CAGGTGGTGG ATGATACTAA ATACAAGGAC TACCAACAGG TGGGCATCCT 
21301 ACACCAACAC AACAACTCTG GATTTGTTGG CTACCTTGCC CCCACCATGC GCGAAGGACA 
21361 GGCCTACCCT GCTAACTTCC CCTATCCGCT TATAGGCAAG ACCGCAGTTG ACAGCATTAC 
21421 CCAGAAAAAG TTTCTTTGCG ATCGCACCCT TTGGCGCATC CCATTCTCCA GTAACTTTAT 
21481 GTCCATGGGC GCACTCACAG ACCTGGGCCA AAACCTTCTC TACGCCAACT CCGCCCACGC 
21541 GCTAGACATG ACTTTTGAGG TGGATCCCAT GGACGAGCCC ACCCTTCTTT ATGTTTTGTT 
21601 TGAAGTCTTT GACGTGGTCC GTGTGCACCG GCCGCACCGC GGCGTCATCG AAACCGTGTA 
21661 CCTGCGCACG CCCTTCTCGG CCGGCAACGC CACAACATAA AGAAGCAAGC AACATCAACA 
21721 ACAGCTGCCG CCATGGGCTC CAGTGAGCAG GAACTGAAAG CCAT TGTCA A AGATCTTGGT 
21781 TGTGGGCCAT A TTTTTTG GG CACCTATGAC AAGCGCTTTC CAGGCTTTGT TTCTCCACAC 
21841 AAGCTCGCCT GCGCCATAGT CAATACGGCC GGTCGCGAGA CTGGGGGCGT ACA CTGGATG 
21901 GCCXTTGCCT GGAACCCGCA CTCAAAAACA TGCTACCTCT TTGAGCCCTT TGGCTTTTCT 
21961 GACCAGCGAC TCAAGCAGGT TTACCAGTTT GAGTACGAGT CACTCCTGCG COGTAGCGCC 
22021 ATTGCTTCTT CCCCCGACCG CTGTATAACG CTGGAAAAGT CCACCCAAAG CGTACAGGGG 
22081 CCCAACTCGG CCGCCTGTGG ACTATTCTGC TGCATGTTTC TCCACGCCTT TGCCAACTGG 
22141 CCCCAAACTC CCATGGATCA CAACCCCACC ATGAACCTTA TTACCGGGGT ACCCAACTCC 
22201 ATGCTCAACA GTCCCCAGOT ACAGCCCACC CTGCGTCGCA ACCAGGAACA GCTCTACAGC 
22261 TTCCTGGAGC GCCACTCGCC CTACTTCCGC AGCCACAGTG CGCAGATTAQ GAGCGCCACT 
22321 TCTTTTTOT C ACTTGAAAAA CATGTAAAAA TAATGTACTA OAGACACTTT CAATAAAGGC 
22381 AAATGCTTTT ATTTGTACAC TCTCGGGTGA TTATTTACCC CCACCCTTGC OGTCTGCGCC 
22441 GTTTAAAAAT CAAAGGGGTT CTGCCGCGCA TCGCTATGCG CCACTGGCAG GGACAOGTTG 
22501 CGATACTCGT GTTTAGTGCT CCACTTAAAC TCAGGCACAA CCATCCGCGG CAGCTCGGTG 
22561 AAGTTTTCAC TCCACAGGCT GCGCACCATC ACCAACGCGT TTAGCAGGTC GGGOGCCGAT 
22621 ATCTTOAAGT CGCAOTTGGG GCCTCCGCCC TGCGCGCGCG AGTTGCGATA CACAGGGTTG 
22681 CAOCACTGGA ACACTATCAG CGCCGGGTGG TGCACGCTGG CCAGCACGCT CTTGT OGQAQ 
22741 ATCAGATCCG CGTCCAGGTC CTCCGCGTTG CTCAGGGCGA ACGOAOTCAA CTTTGGTAGC 
22801 TOC CTTCCCA AAAAGGGCGC GTGCCCAGGC TTTGAGTTGC ACTCGCACCG TAGTGGCATC 
22861 AAAAGGTGAC CGTGCCCGGT CTGGGCGTTA GGATACAGCG CCTGCATAAA AGC CTTGA TC 
22921 TGCTTAAAAG CCACCTGAGC CTTTGCGCCT TCAGAGAAGA ACATGCCGCA AGACTTGCCG 
22981 GAAAACTGAT TGGCCGGACA GGCCGCGTCG TGCAOGCAGC ACCTTGCGTC GQTQTTGQAG 
23041 ATCTGCACCA CATTTCGGCC CCACCGGTTC TTCAOGATCT TGGCCTTGCT AQACTGC TCC 
23101 TTCAGCGCGC GCTGCCCGTT TTCGCTOGTC ACATCCATTT CAATCACGTG CTCCTTATTT 
23161 ATCATAATGC TTCCGTGTAG ACACTTAAGC TCGCCTTCGA TCTCAGOGCA GCGGTGCAGC 
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23221 CACAACGCGC AGCCCGTGGG CTCGTGATGC TTGTAGGTCA CCTCTGCAAA CGACTGCAGG 
23281 TACGCCTGCA GGAATCGCCC CATCATCGTC ACAAAGOTCT TGTTGCTGGT GAAGGTCAGC 
23341 TGCAACCCGC GGTGCTCCTC GTTCAGCCAG OTCTTGCATA CGGCCGCCAG AGCTTCCACT 
23401 TGGTCAGGCA GTAGTTTGAA GTTCGCCTTT AGATCGTTAT CCACGTGGTA CTTQTCCATC 
23461 AGCGCGCGCG CAGCCTCCAT GCCCTTCTCC CACGCAGACA CGATCGGCAC ACTCAGCGGG 
23521 TTCATCACCG TAATTTCACT TTCCGCTTCG CTGGGCTCTT CCTCTTCCTC TTGCGTCCGC 
23581 ATACCACGCG CCACTGGGTC GTCTTCATTC AGCCGCCGCA CTGTGC GCTT ACCTCCTTTG 
23641 CCATGCTTGA TTAGCACCGG TGGGTT G CT G AAACCCACCA TTTOTAGCGC CACATCTTCT 
23701 CTTTCTTCCT CGCTOTCCAC GATTACCTCT GGTGATGGCG GGCGCTCGGG CTTGGGAGAA 
23761 GGGCGCTTCT TTTTCTTCTT GGGCGCAATG GCCAAATCCG COGCCGAGGT COATGGCCGC 
23821 GGGCTGGGTG TGCGCGGCAC CAGCGCGTCT TGTGATGAGT CTTCCTCGTC CTCGGACTCG 
23881 ATACGCCGCC TCATCCGCTT TTTTGGGGGC GCCCGGGGAG GCGGCGGCOA CGGGGACGGG 
23941 GACOACACGT CCTCCATGGT TGGGGGACGT CGCGCCGCAC CGCGTCCGCG CTCGGGGGTG 
24001 GTTTCGCGCT GCTCCTCTTC CCGACTGGCC ATTTCCTTCT CCTATAGGCA GAAAAAGATC 
24061 ATGGAGTCAG TCGAGAAGAA GGACAGCCTA ACCGCCCCCT CTGAGTTCGC CACCACCGCC 
24121 TCCACCGATG CCGCCAACGC GCCTACCACC TTCCCCGTCG AGGCACCCCC GCTTGAGGAG 
24181 GAGGAAGTGA TTATCGAGCA GGACCCAGGT TTTGTAAGCG AAGACGACGA GGACCGCTCA 
24241 GTACCAACAG AGGATAAAAA GCAAGACCAG GACAACGCAG AGGCAAACGA G GAACAA QTC 
24301 GGGCGGGGGG ACGAAAGGCA TGGCGACTAC CTAGATOTGG GAGACGACGT GCTGTTGAAG 
24361 CATCTGCAGC GCCAGTGCGC CATTATCTGC GACGCGTTGC AAGAGCGCAG CGATGTGCCC 
24421 CTCGCCATAG CGGATGTCAG CCTTGCCTAC GAACGCCACC TATTCTCACC GCGCGTACCC 
24481 CCCAAACGCC AAGAAAACGG CACATGOGAG CCCAACCCGC GCCTCAACTT CTACCCCGTA 
24541 TTTGCCGTGC CAGAGGTGCT TGCCACCTAT CACATCTTTT TCCAAAACTG CAAGATACCC 
24601 CTATCCTGCC GTGCCAACCG CAGCCGAGCG GACAAGCAGC TGGCCTTGCG GCAGGGCGCT 
24661 GTCATACCTG ATATCGCCTC GCTCAACGAA GTGCCAAAAA TCTTTGAGGG TCTTGGACGC 
24721 GACGAGAAGC GCGOGGCAAA CGCTCTGCAA CAGGAAAACA GCGAAAATGA AAGTCACTCT 
24781 GGAGTGTTGG TGGAACTCGA GGGTGACAAC GCGCGCCTAG CCGTACTAAA ACGCAGCATC 
24841 GAGGTCACCC ACTTTGCCTA CCCGGCACTT AACCTACCCC CCAAGGTCAT GAGC ACAGTC 
24901 ATGAGTGAGC TGATCGTGCG CCGTGCGCAG CCCCTGGAGA GGGATGCAAA TTTGCAAGAA 
24961 CAAACAGAGG AGGGCCTACC CGCAGTTGGC GACGAGCAGC TAGCGCGCTG GCTTCAAACG 
25021 CGCGAGCCTG CCGACTTGGA GGAGCGACGC AAACTAATGA TGGCCGCAGT GCTCGTTACC 
25081 GTGGAGCTTG AGTGCATGCA GCG GTTCTT T GCTGACCCGG AGATGCAGCG CAAGCTAGAG 
25141 GAAACATTGC ACTACACCTT TCGACAGGGC TACGTACGCC AGGCCTGCAA GATCTCCAAC 
25201 GTGGAGCTCT GCAACCTGGT CTCCTACCTT GGAATTTTGC ACGAAAACCG CCTTGGGCAA 
25261 AACGTGCTTC ATTCCACGCT CAAGGGCGAG GCGCGCCGCG ACTACGTCCG CGACTGCGTT 
25321 TACTTATTTC TATGCTACAC CTGGCAGACG GCCATGGGCG TTTGGCAGCA GTGCTTGGAG 
25381 GAGTGCAACC TCAAGGAGCT GCAGAAACTG CTAAAGCAAA ACTTGAAGGA CCTATGGACG 
25441 GCCTTCAACG AGCGCTCCGT GGCCGCGCAC CTGGCGGACA TCATTTTCCC CGAACGCCTG 
25501 CTTAAAACCC TGCAACAGGG TCTGCCAGAC TTCACCAGTC AAAGCATGTT QCAGAACTTT 
25561 AGGAACTTTA TCCTAGAGCG CTCAGGAATC TTGCCCGCCA CCTGCTOTGC ACTTCCTAGC 
25621 GACTTTGTGC CCATTAAGTA CCGCGAATGC CCTCGGCCGC TTTGGGGCCA CTGCTACCTT 
25681 CTGCAGCTAG CCAACTACCT TGCCTACCAC TCTGACATAA TGGAAGACGT GAGCGGTGAC 
25741 GGTCTACTGG AGTGTCACTG TCGCTGCAAC CTATGCACCC COCACCGCTC CCTGGTTTGC 
25801 AATTCGCAGC TGCTTAACGA AAGTCAAATT ATCGGTACCT TTGAGCTOCA GGGTCCCTCG 
25861 CCTGAOGAAA AGTCCGCGGC TCCGGGGTTG AAACTCACTC CGGGGCTGTG GACGTCGGCT 
25921 TACCTTOGCA AATTTGTACC TGAGGACTAC CACGCCCACG AGATTAGGTT CTACGAAGAC 
25981 CAATCCCGCC CGCCAAATGC GGAGCTTACC GCCTOCGTCA TTACCCAGGG CCACATTCTT 
26041 GGCCAATTGC AAGCCATCAA CAAAGCCCGC CAAGAGTTTC TGCTACGAAA GGGACGGGGG 
26101 GTTTACTTGG ACCCCCAGTC CGGCGAGGAG CTCAACCCAA TCCCCCCGCC GCCGCAGCCC 
26X61 TATCAGCAGC AGCCGCGGGC CCTTGCTTCC CAGGATGGCA CCCAAAAAGA AG CTGCAG CT 
26221 GCCGCCGCCA CCCACGGACG AGGAGGAATA CTGGOACAOT CAGGCAGAGG MGTTTTGGA 
26281 CGAGGAGGAG GAGGACATGA TGGAAGACTG GGAGAGCCTA GAOGAGGAAG CTTCCGAGGT 
26341 CGAAGAGGTG TCAGACGAAA CACCGTCACC CTOGGTCGCA TTCCCCTCGC CGGCGCCCCA 
26401 GAAATCGGCA ACCGOTTCCA GCATGGCTAC AACCTCCGCT CCTCAGGCGC CGCCGGCACT 
26461 GCCCGTTCGC CGACCCAACC GTAGATGGGA CACCACTGGA ACCAGGGCCG GTAAGTCCAA 
26521 GCAGCCGCCG CCOTTAGCCC AAGAGCAACA ACAGCGCCAA GGCTACCGCT CATGGOGOGG 
26581 GCACAAGAAC GCCATAGTTG CTTGCTTGCA AGACTGTGGG GGCAACATCT CCTTCGCCCG 
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26641 



CTCTACCATC AGGGCGTGGC CTTCCCCCGT AACATCCTGC ATTACTACCG 



26701 TCATCTCTAC AGCCCATACT GCACCGGCGG CAGCGGCAGC GGCAGCAACA GCAGCGGCCA 
26761 CACAGAAGCA AAGGCGACCG QATAGCAAGA CTCTGACAAA GCCCAAGAAA TCCACAGCGG 
26821 CGGCAGCAGC AGGAGGAGGA GCGCTGCGTC TGGCGCCCAA COAACCCGTA TCGACCCGCG 
26881 AGCTTAGAAA CAGQATTTTT CCCACTCTGT ATGCTATATT TCAACAGAGC AGGGGCCAAG 
26941 AACAAGAGCT GAAAATAAAA AACAGGTCTC TGCOATCCCT CACCCGCAGC TGCCTGTATC 
27001 ACAAAAGOGA AGATCAGCTT CGGCGCACGC TGGAAGACGC GGAGGCTCTC TTCAGTAAAT 
27061 ACTGCGCGCT GACTCTTAAG GACTAGTTTC GCGCCCTTTC TCAAATTTAA GCGCGAAAAC 
27121 TACGTCATCT CCAGCGGCCA CACCCGGCGC CAGCACCTGT CGTCAGCGCC ATTATGAGCA 
27181 AGGAAATTCC CACGCCCTAC ATGTGGAGTT ACCAGCCACA AATGGGACTT GCGGCTGGAG 
27241 CTGCCCAAGA CTACTCAACC CGAATAAACT ACATGAGCGC GGGACCCCAC ATGATATCCC 
27301 GGGTCAACGG AATCCGCGCC CACCGAAACC GAATTCTCTT GGAACAGGCG GCTATTACCA 
27361 CCACACCTCG TAATAACCTT AATCCCCGTA GTTGGCCCGC TGCCCTGGTG TACCAGGAAA 
27421 GTCCCGCTCC CACCACTGTG GTACTTCCCA GAGACGCCCA GGCCGAAGTT CAGATGACTA 
27481 ACTCAGGGGC GCAGCTTGCG GGOGGCTTTC GTCACAGGGT GCGGTCGCCC GGGCAGGGTA 
27541 TAACTCACCT GAGAATCAGA GGGCQAGGTA TTCAGCTCAA CGACGAGTCG GTOAGCTCCT 
27601 CGCTTGGTCT COGTCOGGAC GGGACATTTC AGATCGGCGG CGCCGGCCGT CCTTCATTCA 
27661 CGCCTCGTCA GGCAATCCTA ACTCTGCAGA CCTCGTCCTC TGAGCCGCGC TCTGGAGGCA 
27721 TTGGAACTCT GCAATTTATT GAGGAGTTTG TGCCATCGGT CTACTTTAAC CCCTTCTCGG 
27781 GACCTCCOGG CCACTATCOG GATCAATTTA TTCCTAACTT TGACGCGGTA AAGGACTCGG 
27841 CGGACGGCTA CGACTGAATG TTAAGTGGAG AGGCAGAGCA ACTGCGCCTG AAACACCTGG 
27901 TCCACTGTCG CCGCCACAAG TGCTTTGCCC GCGACTCCGG TGAGTTTTGC TACTTTGAAT 
27961 TGCCCGAGGA TCATATCGAG GGCCCGGCGC ACGGCGTCCG GCTTACCGCC CAGGGAGAGC 
28021 TTGCCCGTAG CCTGATTOGG GAGTTTACCC AGCGCCCCCT GCTAGTTGAG CGGGACAGGG 
28081 GACCCTGTGT TCTCACTGTG ATTTGCAACT GTCCTAACCT TGGATTACAT CAAGATCTTT 
28141 GTTGCCATCT CTGTGCTGAG TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT 
28201 CGCCATCCTG TAAACGCCAC OGTCTTCACC CGCCCAAGCA AACCAAGGCG AACCTTACCT 
28261 GGTACTTTTA ACATCTCTCC CTCTGTGATT TACAACAGTT TCAACCCAGA CGGAGTGAGT 
28321 CTACGAGAGA ACCTCTCCGA GCTCAGCTAC TCCATCAGAA AAAACACCAC CCTCCTTACC 
28381 TOCCGGGAAC GTACGAGTGC GTCACCGGCC GCTGCACCAC ACCTACCGCC TGACCGTAAA 
28441 CCAGACTTTT TCOGGACAGA CCTCAATAAC TCTGTTTACC AGAACAGGAG GTGAGCTTAG 
28501 AAAACCCTTA GGGTATTAGG CCAAAGGCGC AGCTACTGTG GGGTTTATGA ACAATTCAAG 
28561 CAACTCTACG GGCTATTCTA ATTCAGGTTT CTCTAGAATC GGGGTTGGGG TTATTCTCTG 
28621 TCTTGTGATT CTCTTTATTC TTATACTAAC GCTTCTCTGC CTAAGGCTCG CCX3CCTGCTG 
28681 TGTGCACATT TGCATTTATT GTCAGCTTTT TAAACGCTGG GGTCGCCACC CAAGATGATT 
28741 AGGTACATAA TCCTAGGTTT ACTCACCCTT GCGTCAGCCC ACGGTACCAC CCAAAAGGTG 
28801 GATTTTAAGG AGCCAGCCTG TAATGTTACA TTCGCAGCTG AAGCTAATGA GTGCACCACT 
28861 CTTATAAAAT GCACCACAGA ACATGAAAAG CTGCTTATTC GCCACAAAAA CAAAATTGGC 
28921 AAGTATGCTG TTTATGCTAT TTGGCAGCCA GGTGACACTA CAGAGTATAA TGTTACAGTT 
28981 TTCCAGGGTA AAAGTCATAA AACTTTTATG TATACTTTTC CATTTTATGA AATGTGOGAC 
29041 ATXACCATGT ACATGAGCAA ACAGTATAAG TTGTGGCCCC CACAAAATTG TGTGGAAAAC 
29101 ACTGGCACTT TCTGCTGCAC TGCTATGCTA ATTACAGTGC TOGCTTTGGT CTGTACCCTA 
29161 CTCTATATTA AATACAAAAG CAGACGCAGC TTTATTGAGG AAAAGAAAAT GCCTTAATTT 
29221 ACTAAGTTAC AAAGCTAATG TCACCACTAA CTGCTTTACT CGCTGCTTGC AAAACAAATT 
29281 CAAAAAGTTA GCATTATAAT TAGAATAGGA TTTAAACCCC COGGTCATTT CCTGCTCAAT 
29341 ACCATTCCCC TGAACAATTG ACTCTATGTG GGATATGCTC CAGCGCTACA ACCTTGAAGT 
29401 CAGGCTTCCT GGATGTCAGC ATCTGACTTT GGCCAGCACC TGTCCCGCGG ATTTGTTCCA 
29461 GTCCAACTAC AGCGACCCAC CCTAACAGAG ATGACCAACA CAACCAACGC GGCOGCCGCT 
29521 ACCGGACTTA CATCTACCAC AAATACACCC CAAGTTTCTG CCTTTGTCAA TAACTGGGAT 
29581 AACTTGGGCA TGTGQTGQTT CTCCATAGCG CTTATGTTTG TATGCCTTAT TATTATGTGG 
29641 CTCATCTGCT GCCTAAAGCG CAAACGCGCC CGACCACCCA TCTATAGTCC CATCATTGTG 
29701 CTACACCCAA ACAATGATGG AATCCATAGA TTGGACGGAC TGAAACACAT GTTCTTTTCT 
29761 CTTACAGTAT GATTAAATGA GACATGATTC CTCGAGTTTT TATATTACTG ACCCTTGTTO 
29821 CGCTTTTTTG TGOGTGCTCC ACATTGGCTG CGGTTTCTCA CATCGAAGTA GACTGCATTC 
29881 CAGCCTTCAC AGTCTATTTG CTTTAOGGAT TTGTCACCCT CACGCTCATC TGCAGCCTCA 
29941 TCACTGTGGT CATCGCCTTT ATCCAGTGCA TTGACTGGGT CTGTGTGOGC TTTGCATATC 
30001 TCAGACACCA TCCCCAOTAC AGGGACAGGA CTATAGCTGA GCTTCTTAGA ATTCTTTAAT 
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30061 TATGAAATTT ACTGTGACTT TTCTQCTGAT TATTTGCACC CTATCTGCGT TTTGTTCCCC 
30121 GACCTCCAAG CCTCAAAGAC ATATATCATG CAGATTCACT CGTATATGGA ATATTCCAAG 
30181 TTGCTACAAT GAAAAAAOCG ATCTTTCCGA AGCCTGGTTA TATGCAATCA TCTCT GT TAT 
30241 GOTGTTCTGC AGTACCATCT TAGCCCTAGC TATATATCCC TACCTTGACA TTGGCTGGAA 
30301 ACGAATAGAT GCCATGAACC ACCCAACTTT CCCCOCGCCC GCTATGCTTC CACTGCAACA 
30361 AGTTGTTGCC GGCGGCTTTG TCCCAGCCAA TCAGCCTCGC CCCACTTCTC CCACCCCCAC 
30421 TGAAATCAGC TACTTTAATC TAACAGGAGG AGATGACTGA CACCCTAGAT CTAGAAATGG 
30481 ACGGAATTAT TACAGAGCAG CGCCTGCTAG AAAGACGCAG GGCAGCGGCC GAGCAACAGC 
30541 GCATGAATCA AGAGCTCCAA GACATGGTTA ACTTGCACCA GTGCAAAAGG GGTATCTTTT 
30601 GTCTGGTAAA GCAGGCCAAA GTCACCTACG ACAGTAATAC CACCGGACAC CGCCTTAGCT 
30661 ACAAGTTGCC AACCAAGCGT CAGAAATTGG TGGTCATGGT GGGAGAAAAG CCCATTACCA 
30721 TAACTCAGCA CTCGGTAGAA ACCGAAGGCT GCATTCACTC ACCTTGTCAA GGACCTGAGG 
30781 ATCTCTGCAC CCTTATTAAG ACCCTGTGCG GTCTCAAAGA TCTTATTCCC TTTAACTAAT 
30841 AAAAAAAAAT AATAAAGCAT CACTTACTTA AAATCAGTTA GCAAATTTCT GTCCAGTTTA 
30901 TTCAGCAGCA CCTCCTTGCC CTCCTCCCAG CTCTGGTATT GCAGCTTCCT CCTGGCTGCA 
30961 AACTTTCTCC ACAATCTAAA TGGAATGTCA GTTTCCTCCT GTTCCTGTCC ATCCGCACCC 
31021 ACTATCTTCA TGTTGTTGCA GATGAAGCGC GCAAGACCGT CTGAAGATAC CTTCAACCCC 
31081 GTGTATCCAT ATGACACGGA AACCGGTCCT CCAACTGTGC CTTTTCTTAC TCCTCCCTTT 
31141 GTATCCCCCA ATGGGTTTCA AGAGAGTCCC CCTGGGGTAC TCTCTTTGCG CCTATCCGAA 
31201 CCTCTAGTTA CCTCCAATGG CATGCTTGCG CTCAAAATGG GCAACGGCCT CTCTCTGGAC 
31261 GAGGCCGGCA ACCTTACCTC CCAAAATGTA ACCACTGTGA GCCCACCTCT CAAAAAAACC 
31321 AAGTCAAACA TAAACCTGGA AATATCTGCA CCCCTCACAG TTACCTCAGA AGCCCTAACT 
31381 GTGGCTGCCG CCGCACCTCT AATGGTCGCG GGCAACACAC TCACCATGCA ATCACAGGCC 
31441 CCGCTAACCG TGCACGACTC CAAACTTAGC ATTGCCACCC AAGGACCCCT CACAGTGTCA 
31501 GAAGGAAAGC TAGCCCTGCA AACATCAGGC CCCCTCACCA CCACCGATAG CAGTACCCTT 
31561 ACTATCACTG CCTCACCCCC TCTAACTACT GCCACTGGTA GCTTGGGCAT TGACTTGAAA 
31621 GAGCCCATTT ATACACAAAA TGGAAAACTA GGACTAAAGT ACGGGGCTCC TTTGCATGTA 
31681 ACAGACGACC TAAACACTTT GACCGTAGCA ACTGGTCCAG GTGTGACTAT TAATAATACT 
31741 TCCTTGCAAA CTAAAGTTAC TGGAGCCTTG GGTTTTGATT CACAAGGCAA TATGCAACTT 
31601 AATGTAGCAG GAGGACTAAG GATTGATTCT CAAAACAGAC GCCTTATACT TGATGTTAGT 
31861 TATCCGTTTG ATGCTCAAAA CCAACTAAAT CTAAGACTAG GACAGGGCCC TCTTTTTATA 
31921 AACTCAGCCC ACAACTTGGA TATTAACTAC AACAAAGGCC TTTACTTGTT TACAGCTTCA 
31981 AACAATTCCA AAAAGCTTGA GGTTAACCTA AGCACTGCCA AGGGGTTGAT GTTTGACGCT 
32041 ACAGCCATAG CCATTAATGC AGGAGATGGG CTTGAATTTG GTTCACCTAA TGCACCAAAC 
32101 ACAAATCCCC TCAAAACAAA AATTGGCCAT GGCCTAOAAT TTGATTCAAA CAAGGCTATG 
32161 GTTCCTAAAC TAGGAACTGG CCTTAGTTTT GACAGCACAG GTGCCATTAC AQTAGGAAAC 
32221 AAAAATAATG ATAAGCTAAC TTTGTGGACC ACACCAGCTC CATCTCCTAA CTGTAGACTA 
32281 AATGCAGAGA AAGATGCTAA ACTCACTTTG GTCTTAACAA AATGTGGCAG TCAAATACTT 
32341 GCTACAGTTT CAGTTTTGGC TGTTAAAGGC AGTTTGGCTC CAATATCTGG AACAGTTCAA 
32401 AGTGCTCATC TTATTATAAG ATTTGACGAA AATGGAGTGC TACTAAACAA TTCCTTCCTG 
32461 GACCCAGAAT ATTGGAACTT TAGAAATGGA GATCTTACTG AAGGCACAGC CTATACAAAC 
32521 GCTGTTGGAT TTATGCCTAA CCTATCAGCT TATCCAAAAT CTCACGGTAA AACTGCCAAA 
32561 AGTAACATTG TCAGTCAAGT TTACTTAAAC GGAGACAAAA CTAAACCTGT AACACTAACC 
32641 ATTACACTAA ACGGTACACA GGAAACAGGA GACACAACTC CAAGTGCATA CTCTATGTCA 
32701 TTTTCATGGG ACTGGTCTGG CCACAACTAC ATTAATGAAA TATTTGCCAC ATCCTCTTAC 
32761 ACTTTTTCAT ACATTGCCCA AGAATAAAGA ATCGTTTGTG TTATGTTTCA ACGTGTTTAT 
32821 TTTTCAATTG CAGAAAATTT CAAGTCATTT TTCATTCAGT AGTATAGCCC CACCACCACA 
32861 TAGCTTATAC AGATCACCGT ACCTTAATCA AACTCACAGA ACCCTAGTAT TCAACCTGCC 
32941 ACCTCCCTCC CAACACACAG AGTACACAGT CCTTTCTCCC CGGCTGGCCT TAAAAAGCAT 
33001 CATATCATGG GTAACAGACA TATTCTTAGG TGTTATATTC CACACGGTTT CCTGTCGAGC 
33061 CAAACGCTCA TCAGTGATAT TAATAAACTC CCCGGGCAGC TCACTTAAGT TCATGTCGCT 
33121 GTCCAGCTGC TGAGCCACAG GCTGCTGTCC AACTTGCGGT TGCTTAACGG GCGGCGAAGG 
33181 AGAAGTCCAC GCCTACATGG GGGTAGAGTC ATAATCGTGC ATCAGGATAG GGOGGTGGTG 
33241 CTGCAGCAGC GCGCGAATAA ACTGCTGCCG CCGCCGCTCC GTCCTGCAGG AATACAACAT 
33301 GGCAGTGGTC TCCTCAGCGA TGATTCGCAC CGCCCGCAGC ATAAGGCGCC TTGTCCTCCG 
33361 GGCACAGCAG CGCACCCTGA TCTCACTTAA ATCAGCACAG TAACTGCAGC ACAGCACCAC 
33421 AATATTGTTC AAAATCCCAC AGTGCAAGGC GCTGTATCCA AAGCTCATGG CGGGGACCAC 
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33481 AGAACCCACG TGGCCATCAT ACCACAAGCG CAGOTAGATT AAGTGGCGAC CCCTCATAAA 
33541 CACGCTGGAC ATAAACATTA CCTCTTTTGG CATGTTGTAA TTCACCACCT CCCGGTACCA 
33601 TATAAACCTC TGATTAAACA TGGCGCCATC CACCACCATC CTAAACCAGC TGOCCAAAAC 
33661 CTGCCCGCCG GCTATACACT GCAGGGAACC GGGACTGGAA CAATGACAGT GGAGAGCCCA 
33721 GGACTCGTAA CCATGGATCA TCATGCTCGT CATOATATCA ATGTTGGCAC AACACAGGCA 
33781 CACGTGCATA CACTTCCTCA GGATTACAAG CTCCTCCCGC GTTAGAACCA TATCCCAGGG 
33 841 AACAACCCAT TCCTGAATCA GCGTAAATCC CACACTGCAG GGAAGACCTC GCACGTAACT 
33901 CACGTTGTGC ATTGTCAAAG TGTTACATTC GGGCAGCAGC GGATGATCCT CCAGTATGGT 
33961 AGCGCGGGTT TCTGTCTCAA AAGGAGGTAG ACGATCCCTA CTGTACGGAG TGCGCCGAGA 
34021 CAACCGAGAT CGTQTTGGTC GTAGTGTCAT GCCAAATGGA ACGCCGGACG TAGTCATATT 
34081 TCCTGAAGCA AAACCAGGTG CGGGCGTGAC AAACAGATCT GCGTCTCCGG TCTCGCCGCT 
34141 TAGATCGCTC TGTGTAGTAG TTGTAGTATA TCCACTCTCT CAAAGCATCC AGGCGCCCCC 
34201 TGGCTTCGGG TTCTATGTAA ACTCCTTCAT GCGCCGCTGC CCTGATAACA TCCACCACCG 
34261 CAGAATAAGC CACACCCAGC CAACCTACAC ATTCGTTCTG CGAGTCACAC ACGGGAGGAG 
34321 CGGOAAGAGC TGGAAGAACC ATGTTTTTTT TTTTATTCCA AAAGATTATC CAAAACCTCA 
34381 AAATGAAGAT CTATTAAGTG AACGCGCTCC CCTCCGGTGG CGTGGTCAAA CTCTACAGCC 
34441 AAAGAACAGA TAATGGCATT TGTAAGATGT TGCACAATGG CTTCCAAAAG GCAAACGGCC 
34501 CTCACGTCCA AGTGGACGTA AAGGCTAAAC CCTTCAGGGT G AATCTCC TC TATAAACATT 
34561 CCAGCACCTT CAACCATGCC CAAATAATTC TCATCTCGCC ACCTTCTCAA TATATCTCTA 
34621 AGCAAATCCC GAATATTAAG TCCGQCCATT GTAAAAATCT GCTCCAGAGC GCCCTCCACC 
34681 TTCAGCCTCA AGCAGCGAAT CATGATTGCA AAAATTCAGG TTCCTCACAG ACCTGTATAA 
34741 GATTCAAAAG CGGAACATTA ACAAAAATAC CGCGATCCCG TAGGTCCCTT CGCAGGGCCA 
34801 GCTGAACATA ATCGTGCAGG TCTGCAOGGA CCAGCGCGGC CACTTCCCCG CCAGOAACCT 
34861 TGACAAAAGA ACCCACACTG ATTATGACAC GCATACTCGG AGCTATGCTA ACCAGCGTAG 
34921 CCCCGATGTA AGCTTTGTTG CATGGGCGGC GATATAAAAT GCAAGGTGCT GCTCAAAAAA 
34981 TCAGGCAAAG CCTCGCGCAA AAAAGAAAGC ACATCGTAGT CATGCTCATG CAGATAAAGG 
35041 CAGGTAAGCT CCGGAACCAC CACAGAAAAA, GACACCATTT TTCTCTCAAA CATGTCTGCG 
35101 GGTTTCTGCA TAAACACAAA ATAAAATAAC* AAAAAAACAT TTAAACATTA GAAGCCTGTC 
35161 TTACAACAGG AAAAACAACC CTTAXAAGCA TAAGACGGAC TACGGCCATG CGGGCGTGAC 
35221 CGTAAAAAAA CTGGTCACCG TGATTAAAAA GCACCACCGA CAGCTCCTCG GTCATGTCCG 
35281 GAGTCATAAT GTAAGACTCG GTAAACACAT CAGGTTGATT CATCGGTCAG TGCTAAAAAG 
35341 CGACCGAAAT AGCCCGGGGG AATACATACC CGCAGGCGTA GAGACAACAT TACAGCCCCC 
35401 ATAGGAGGTA TAACAAAATT AATAGGAGAG AAAAACACAT AAACACCTGA AAAACCCTCC 
35461 TGCCTAGGCA AAATAGCACC CTCCCGCTCC AGAACAACAT ACAGCGCTTC ACAGCGGCAG 
35521 CCTAACAGTC AGCCTTACCA GTAAAAAAGA AAACCTATTA AAAAAACACC ACTCGACACG 
35581 GCACCAGCTC AATCAGTCAC AGTGTAAAAA AGGGCCAAGT GCAGAGCGAG TATATATAGG 
35641 ACTAAAAAAT GACGTAACGG TTAAAGTCCA CAAAAAACAC CCAGAAAACC GCACGCGAAC 
35701 CTACGCCCAG AAACGAAAGC CAAAAAACCC ACAACTTCCT CAAATCGTCA CTTCCGTTTT 
35761 CCCACGTTAC GTAACTTCCC ATTTTAAGAA AACTACAATT CCCAACACAT ACAAGTTACT 
35821 CCGCCCTAAA ACCTACGTCA CCCGCCCCGT TCCCACGCCC CGCGCCACGT CACAAACTCC 
35881 ACCCCCTCAT TATCATATTG GCTTCAATCC AAAATAAGGT ATATTATTGA TGATG 
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UOOJS KD1 33592 bp DNA SYN 2 8 -APR- 1999 

DEFINITION KD1 
ACCESSION KD1 
KEYWORDS 

SOURCE Unknown . 

ORGANISM Unknown 

Unclassified. 
REFERENCE 1 (bases 1 to 33592) 

AUTHORS Self 
. JOURNAL Unpublished. 
FEATURES Location/Qualifiers 
CDS 1.. 33592 

/gene="KDl" 
/product«°KDl" 
BASE COUNT 7744 a 9470 C 9285 g 7093 t 

ORIGIN 

1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG GGGGTGGAGT 
61 TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG TAGTAGTGTG GCG GAAGTGT 
121 GATGTTGCAA GTGTGGCGGA ACACATGTAA GCGACGGATG TGGCAAAAGT GACGTTTTTG 
181 GTGTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 
241 TAAATTTGGG CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTT GTCTA GGGCCGCGGG 
361 GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT TTCCGCGTTC 
421 CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TGTAGTGTAT TTATACCCGG 
481 TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 
541 TCCGACACCG GGACTGAAAA TGAGACATGA GGTACTGGCT GATAATCTTC CACCTCCTAG 
601 CCATTTTGAA CCACCTACCC TTCACGAACT GTATGATTTA GACGTGACGG CCCCCGAAGA 
661 TCCCAACGAG GAGGCGCTTT CGCAGATTTT TCCCGACTCT GTAATGTTGG CGGTGCAGGA 
721 AGGGATTGAC TTACTCACTT TTCCGCCGGC GCCCGGTTCT CCGGAGCCGC CTCACCTTTC 
781 CCGGCAGCCC GAGCAGCCGG AGCAGAGAGC CTTGGGTCCG GTTTGCCACG AGGCTGGCTT 
841 TCCACCCAGT GACGACGAGG ATGAAGAGGG TGAGGAGTTT GTGTTAGATT ATGTGGAGCA 
901 CCCCGGGCAC GGTTGCAGGT CTTGTCATTA TCACCGGAGQ AATACGGGGG ACCCAGATAT 
961 TATGTGTTCG CTTTGCTATA TGAGGACCTG TGGCATGTTT GTCTACAGTA AGTGAAAATT 
1021 ATGGGCAGTG GGTGATAGAG TGGTGGGTTT GGTGTGGTAA TTTTTTTTTT AATTTTTACA 
1081 GTTTTGTGGT TTAAAGAATT TTGTATTGTG ATTTTTTTAA AAGGTCCTGT GTCTOAACCT 
1141 GAGCCTGAGC CCGAGCCAGA ACOGGAGCCT GCAAGACCTA CCCGCCGTCC TAAAATGGCG 
1201 CCTGCTATCC TGAGACGCCC GACATCACCT GTGTCTAGAG AATGCAATAG TAGTACGGAT 
1261 AGCTGTGACT CCGGTCCTTC TAACACACCT CCTGAGATAC ACC CGGTG GT CCCGCTGTGC 
1321 CCCATTAAAC CAGTTGCCGT GAGAGTTGGT GGGCGTCGCC AGGCTGTGGA ATGTATCGAG 
1381 GACTTGCTTA ACGAGCCTGG GCAACCTTTG GACTTGAGCT GTAAACGCCC CAGGCCATAA 
1441 GGTGTAAACC TGTGATTGCG TGTGTGGTTA ACGCCTTTGT TTGCTGAATG AGTTGATGTA 
1501 AGTTTAATAA AGGGTGAGAT AATGTTTAAC TTGCATGGCG TGTTAAATGG GGOGGGGCTT 
1561 AAAGGGTATA TAATGCGCCG TGGGCTAATC TTGGTTACAT CTGACCTCAT GGAGGCTTGG 
1621 GAGTCTTTGG AAGATTTTTC TGCTGTGOGT AACTTGCTGG AACAGAGCTC TAACAGTACC 
1681 ItrriW lT I T GGAGGTTTCT GTGGGGCTCA TCCCAGGCAA AGTTAGTCTG CAGAA TTAAG 
1741 GAGGATTACA AGTGGGAATT TGAAGAGCTT TTGAAATCCT GTGGTG AGCT GTTTOATTCT 
1801 TTGAATCTGG GTCACCAGGC GCTTTTCCAA GAGAAGGTCA TCAAGACTTT GGATTTTTCC 
1861 ACACCGGGGC GCGCTGCGGC TGCTGTTGCT TTTTTQAQTT TTATAAAGGA TAAATGGAGC 
1921 GAAOAAACCC ATCTGAGCGG GGGGTACCTG CTGGATTTTC TGGCCATGCA TCTGTGGAGA 
1981 GCGGTTGTGA GACACAAGAA TCGCCTGCTA CTGTTGTCTT CCGTCCGCCC GGCGATAATA 
2041 CCGACGGAGG AGCAGCAGCA GCAGCAGGAG GAAGCCAGGC GGCGGCGGCA GGAGCAGAGC 
2101 CCATGGAACC COAOAGCCGG CCTGOACCCT CGGGAATGAA TGTTGTACAG GTGGCTGAAC 
2161 TGTATCCAGA ACTGAGACGC ATTTTGACAA TTACAGAGGA TGGGCAGGGG CTAAAGGGQQ 
2221 TAAAGAGGGA GCGGGGGGCT TGTGAGGCTA CAGAGGAGGC TAGGAATCTA GCTTTTAGCT 
2281 TAATGACCAG ACACCGTCCT GAGTGTATTA CTTTTCAACA GATCAAGGAT AATTGCGCTA 
2341 ATGAGCTTGA TCTGCTGGCG CAGAAGTATT CCATAGAGCA GCTGACCACT TACTGGCTGC 
2401 AGCCAGGGOA TGATTTTGAG GAGGCTATTA GGGTATATGC AAAGGTGGCA CTTAGGCCAG 
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2461 ATTGCAAGTA CAAGATCAGC AAACTTGTAA ATATCAGGAA TTGTTGCTAC AT TTCTGGGA 
2521 ACGGGGCCGA GGTGGAGATA GATACGGAGG ATAGGGTGGC CTTTAOATGT AGCATGATAA 
2581 ATATGTGQCC GGGGGTGCTT GGCATGGACG GGGTGGTTAT TATGAATGTA AGGTTTACTG 
2641 GCCCCAATTT TAGCGGTACG GTTTTCCTGG CCAATACCAA CCTTATCCTA CACGGTGTAA 
2701 GCTTCTATGG GTTTAACAAT ACCTGTGTGG AAGCCTGGAC CGATGTAAGG GTTCX3GGGCT 
2761 GTGCCTTTTA CTGCTGCTGG AAGGGGGTGG TGTGTCGCCC CAAAAGCAGG GCTTCAATTA 
2821 AGAAATGCCT CTTTGAAAGG TGTACCTTGG GTATCCTGTC TGAGGGTAAC TCCAGGGTGC 
2881 GCCACAATGT GGCCTCCGAC TGTGGTTGCT TCATGCTAGT GAAAAGCGTG GCTGTGATTA 
2941 AGCATAACAT GGTATGTGGC AACTGCGAGG ACAGGGCCTC TCAGATGCTG ACCTGCTCGG 
3001 ACGGCAACTG TCACCTGCTG AAGACCATTC ACGTAGCCAG CCACTCTCGC AAGGCCTGGC 
3061 CAGTGTTTGA GCATAACATA CTGACCCGCT GTTCCTTGCA TTTGGGTAAC AGGAGGGGGG 
3121 TGTTCCTACC TTACCAATGC AATTTGAGTC ACACTAAGAT ATTGCTTGAG CCCGAGAGCA 
3181 TGTCCAAGGT GAACCTGAAC GGGGTGTTTG ACATGACCAT GAAGATCTGG AAGGTGCTGA 
3241 GGTACGATGA GACCCGCACC AGGTGCAGAC CCTGCGAGTG TGGCGGTAAA CATATTAGGA 
3301 ACCAGCCTGT GATGCTGGAT GTGACCGAGG AGCTGAGGCC CGATCACTTG GTGCTGGCCT 
3361 GCACCCGCGC TGAGTTTGGC TCTAGCGATG AAGATACAGA TTGAGGTACT GAAATGTGTG 
3421 GGCGTGGCTT AAGGGTGGGA AAGAATATAT AAGGTGGGGG TCTTATGTAG TTTTGTATCT 
3481 GTTTTGCAGC AGCCGCCGCC GCCATGAGCA CCAACTCGTT TGATGGAAGC ATTGTGAGCT 
3541 CATATTTGAC AACGCGCATG CCCCCATGGG CCGGGGTGCG TCAGAATGTG ATGGGCTCCA 
3601 GCATTGATGG TCGCCCOGTC CTGCCCGCAA ACTCTACTAC CTTGACCTAC GAGACCGTGT 
3661 CTGGAACGCC GTTGGAGACT GCAGCCTCCG CCGCCGCTTC AGCCGCTGCA GCCACCGCCC 
3721 GCGGGATTGT GACTGACTTT GCTTTCCTGA GCCCGCTTGC AAGCAGTGCA GCTTCCCGTT 
3781 CATCCGCCCG CGATGACAAG TTGACGGCTC TTTTGGCACA ATTGGATTCT TTGACCCGGG 
3841 AACTTAATGT CGTTTCTCAG CAGCTGTTGG ATCTGCGCCA GCAGGTTTCT GCCCTGAAGG 
3901 CTTCCTCCCC TCCCAATGCG GTTTAAAACA TAAATAAAAA ACCAGACTCT GTTTGGATTT 
3961 GGATCAAGCA AGTGTCTTGC TGTCTTTATT TAGGGGTTTT GCGCGCGCGG TAGGCCCGGG 
4021 ACCAGCGGTC TCGGTCGTTG AGGGTCCTGT GTATTTTTTC CAGGACGTGG TAAAGGTGAC 
4081 TCTGGATGTT CAGATACATG GGCATAAGCC CGTCTCTGGG GTGGAGGTAG CACCACTGCA 
4141 GAGCTTCATG CTGCGGGGTG GTGTTGTAGA TGATCCAGTC GTAGCAGGAG CGCTGGGCGT 
4201 GGTGCCTAAA AATGTCTTTC AGTAGCAAGC TGATTGCCAG GGGCAGGCCC TTGGTGTAAG 
4261 TGTTTACAAA GCGGTTAAGC TGGGATGGGT GCATACGTGG GGATATGAGA TGCATCTTGG 
4321 ACTGTATTTT TAGGTTGGCT ATGTTCCCAG CCATATCCCT CCGGGGATTC ATGTTGTGCA 
4381 GAACCACCAG CACAGTGTAT CCGGTGCACT TGGGAAATTT GTCATGTAGC TTAGAAGGAA 
4441 ATGCGTGGAA GAACTTGGAG ACGCCCTTGT GACCTCCAAG ATTTTCCATG CATTCGTCCA 
4501 TAATGATGGC AATGGGCCCA CGGGCGGCGG CCTGGGCGAA GATATTTCTG GGATCACTAA 
4561 CGTCATAGTT GTGTTCCAGG ATGAGATCGT CATAGGCCAT TTTTACAAAG CGCGGGCGGA 
4621 GGGTGCCAGA CTGCGGTATA ATGGTTCCAT CCGGCCCAGG GGCGTAGTTA CCCTCACAGA 
4681 TTTGCATTTC CCACGCTTTG AGTTCAGATG GGGGGATCAT GTCTACCTGC GGGGCGATGA 
4741 AGAAAACGGT TTCCGGGGTA GGGGAGATCA GCTGGGAAGA AAGCAGGTTC CTGAGCAGCT 
4801 GCOACTTACC GCAGCCGGTG GGCCCGTAAA TCACACCTAT TACCGGGTGC AACTGGTAGT 
4861 TAAGAGAGCT GCAGCTGCCG TCATCCCTGA GCAGGGGGGC CACTTCGTTA AGCATGTCCC 
4921 TGACTCGCAT GTTTTCCCTG ACCAAATCCG CCAGAAGGCG CTCGCCGCCC AGCGATAGCA 
4981 GTTCTTGCAA GGAAGCAAAG TTTTTCAAOG GTTTGAGACC GTCCGCCGTA GGCATGCTTT 
5041 TGAGCGTTTG ACCAAGCAGT TCCAGGCGGT CCCACAGCTC GGTCACCTGC TCTAOGGCAT 
5101 CTCGATCCAG CATATCTCCT CGTTTCGCGG GTTGGGGCGG CTTTCGCTGT ACGGCAGTAG 
5161 TCGGTGCTCG TCCAGACGGG CCAGGGTCAT GTCTTTCCAC GGGCGCAGGG TCCTCGTCAG 
5221 CGTAGTCTGG GTCACGGTGA AGGGGTGCGC TCCGGGCTGC GCGCTGGCCA GGOTGCGCTT 
5281 GAGGCTGGTC CTGCTGGTGC TGAAGCGCTG CCGGTCTTCG CCCTGCGCGT CGGCCAGGTA 
5341 GCATTTGACC ATGGTGTCAT AGTCCAGCCC CTCCGCGGCG TGGCCCTTGG CGCGCAGCTT 
5401 GCCCTTGGAG GAGGCGCCGC ACGAGGGGCA GTGCAGACTT TTGAGGGCGT AGAGCTTGGG 
5461 CGCOAGAAAT ACCGATTCCG GGGAGTAGGC ATCCGCGCCG CAGGCCCCGC AGACGGTCTC 
5521 GCATTCCACG AGCCAGGTGA GCTCTGGCCG TTOGGGGTCA AAAACCAGGT TTCCCCCATG 
5581 CTTTTTGATG CGTTTCTTAC CTCTGGTTTC CATGAGCCGG TGTCCAOGCT CGGTGACGAA 
5641 AAGGCTGTCC GTGTCCCCGT ATACAGACTT GAGAGGCCTG TCCTCGAGCG GTOTTCCGCG 
5701 GTCCTCCTCG TATAGAAACT CGGACCACTC TGAGACAAAG GCTCGCGTCC AGGCCAGCAC 
5761 GAAGGAGGCT AAGTGGGAGG GGTAGCGGTC GTTGTCCACT AGGGGGTCCA CTCGCTCCAG 
5821 GGTGTGAAGA CACATGTCGC CCTCTTCGGC ATCAAGGAAG GTGATTGGTT TGTAGGTGTA 
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5881 GGCCACGTGA CCGGGTGTTC CTGAAGGGGG GCTATAAAAG GGGGTGGGGG CGCGTTCGTC 
5941 CTCACTCTCT TCCGCATCGC TGTCTGCGAG GGCCAGCTGT TGGGGTGAGT ACTCCCTCTG 
6001 AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTGAT 
6061 ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCA TCCATCTGGT CAGAAAAGAC 
6121 AATCTTTTTG TTGTCAAGCT TGGTGGCAAA CGACCCGTAG AGGGCGTTGG ACAGCAACTT 
6181 GGCGATGGAG CGCAGGGTTT GGTTTTTGTC GCGATCGGCG CGCTCCTTGG CCGCGATGTT 
6241 TAGCTGCACG TATTCGCGCG CAACGCACCG CCATTOGGGA AAGAOGGTGG TGCGCTCGTC 
6301 GGGCACCAGG TGCACGCGCC AACCGCGGTT GTGCAGGGTG ACAAGGTCAA CGCTGGTGGC 
6361 TACCTCTCCG CGTAGGCGCT CGTTGGTCCA GCAGAGGCGG CCGCCCTTGC GCGAGCAGAA 
6421 TGGCGGTAGG GGGTCTAGCT GCGTCTCGTC CGGGGGGTCT GCGTCCACGG TAAAGACCCC 
6481 GGGCAGCAGG CGCGCGTCGA AGTAGTCTAT CTTGCATCCT TGCAAGTCTA GCGCCTGCTG 
6S41 CCATGCGCGG GCGGCAAGCG CGCGCTCGTA TGGGTTGAGT GGGGGACCCC ATGGCATGGG 
6601 GTGGGTGAGC GCGGAGGCGT ACATGCCGCA AATGTCGTAA ACGTAGAGGO GCTCTCTGAG 
6661 TATTCCAAGA TATGTAGGGT AGCATCTTCC ACCGCGGATG CTGGCGCGCA CGTAATCGTA 
6721 TAGTTCGTGC GAGGGAGCGA GGAGGTCGGG ACCGAGGTTG CTACGGGOGG GCTGCTCTGC 
6781 TCGGAAGACT ATCTGCCTGA AGATGGCATG TGAGTTGGAT GATATGGTTG GACGCTGGAA 
6841 GACGTTGAAG CTGGCGTCTG TGAGACCTAC CGCGTCACGC ACGAAGOAGG CGTAGGAGTC 
6901 GCGCAGCTTG TTGACCAGCT CGGCGGTGAC CTGCACGTCT AGGGOGCAGT AGTCGAGGGT 
6961 TTCCTTGATG ATGTCATACT TATCCTGTCC CTTTTTTTTC CACAGCTCGC GGTTGAGGAC 
7021 AAACTCTTCG CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTOGGCCT CCGAACGGTA 
7081 AGAGCCTAGC ATGTAGAACT GGTTGACGGC CTGGTAGGCG CAGCATCCCT TTTCTACGGG 
7141 TAGCGCGTAT GCCTGCGCGG CCTTCCGGAG CGAGGTGTGG GTGAGCGCAA AGGTGTCCCT 
7201 GACCATGACT TTGAGGTACT GGTATTTGAA GTCAGTGTCG TCGCATCCGC CCTGCTCCCA 
7261 GAGCAAAAAG TCCGTGCGCT TTTTGGAACG CGGATTTGGC AGGGCGAAGG TGACATCGTT 
7321 GAAGAGTATC TTTCCCGCGC GAGGCATAAA GTTGCGTGTG ATGCGGAAGG GTCCCGGCAC 
7381 CTCGGAACGG TTGTTAATTA CCTGGGCGGC GAGCACGATC TCGTCAAAGC CGTTGATGTT 
7441 GTGGCCCACA ATGTAAAGTT CCAAGAAGCG CGGGATGCCC TTGATGGAAG GCAATTTTTT 
7501 AAGTTCCTCG TAGGTGAGCT CTTCAGGGGA GCTGAGCCCG TGCTCTGAAA GGGCCCAGTC 
7561 TGCAAGATGA GGGTTGGAAG CGAOGAATGA GCTCCACAGG TCACGGGCCA TTAGCATTTG 
7621 CAGGTGGTCG CGAAAGGTCC TAAACTGGCG ACCTATGGCC ATTTTTTCTG GGGTGATGCA 
7681 GTAGAAGGTA AGCGGGTCTT GTTCCCAGCG GTCCCATCCA AGGTTCGCGG CTAGGTCTCG 
7741 CGCGGCAGTC ACTAGAGGCT CATCTCCGCC GAACTTCATG ACCAGCATGA AGGGCACGAG 
7801 CTGCTTCCCA AAGGCCCCCA TCCAAGTATA GGTCTCTACA TCGTAGGTGA CAAAGAGACG 
7861 CTCGGTGCGA GGATGCGAGC CGATCGGGAA GAACTGGATC TCCCGCCACC AATTGGAGGA 
7921 GTGGCTATTG ATGTGGTGAA AGTAGAAGTC CCTGCGACGG GCCGAACACT CGTGCTGGCT 
7981 TTTGTAAAAA CGTGCGCAGT ACTGGCAGCG GTGCACGGGC TGTACATCCT GCACGAGGTT 
8041 GACCTGACGA CCGCGCACAA GGAAGCAGAG TGGGAATTTG AGCCCCTOGC CTGGCGGGTT 
8101 TGGCTGGTGG TCTTCTACTT CGGCTGCTTG TCCTTGACCG TCTGGCTGCT CGAGGGGAGT 
8161 TACGGTGGAT CGGACCACCA CGCCGCGCGA GCCCAAAGTC CAGATGTCCG CGCGCGGCGG 
8221 TCGGAGCTTG ATGACAACAT CGCGCAGATG GGAGCTGTCC ATGGTCTGGA GCTCCOGCGG 
8281 CGTCAGGTCA GGCGGGAGCT CCTGCAGGTT TACCTCGCAT AGACGGGTCA GGGCGOGGGC 
8341 TAGATCCAGG TGATACCTAA TTTCCAGGGG CTGGTTGGTG GCGGCGTCGA TGGCTTGCAA 
8401 GAGGCCGCAT CCCCGCGGCG CGACTACGGT ACCGCGCGGC GGGCGGTGGG CCGCGGGGGT 
8461 GTCCTTGGAT GATGCATCTA AAAGCGGTGA CGCGGGCGAG CCCCOGGAGG TAGGGGGGOC 
8521 TCCGGACCCG CCGGGAGAGG GGGCAGGGGC ACGTCGGCGC CGCGCGCGGG CAGGAGCTGG 
8581 TGCTGCGCGC GTAGGTTGCT GGCGAACGCG ACGACGCGGC GGTTGATCTC CTGAATCTGG 
8641 CGCCTCTGCG TGAAGACGAC GGGCCCGGTG AGCTTGAGCC TGAAAGAGAG TTOGACAGAA 
8701 TCAATTTCGG TGTCGTTGAC GGCGGCCTGG CGCAAAATCT CCTGCACGTC TCCTGAGTTG 
8761 TCTTGATAGG CGATCTCGGC CATGAACTGC TOOATCTCTT CCTCCTGGAG ATCTCOGCGT 
8821 CCGGCTCGCT CCACGGTGGC GGCGAGGTCG TTGGAAATGC GGGCCATGAG CTGCGAGAAG 
8881 G CGTTG AGGC CTCCCTCGTT CCAGACGCGG CTGTAGACCA CGCCCCCTTC GGCATCGCGG 
8941 GCGCGCATGA CCACCTGCGC GAGATTGAGC TCCACGTGCC GGGCGAAGAC GGCGTAGTTT 
9001 CGCAGGOGCT GAAAGAGGTA GTTGAGGGTG GTGGCGGTGT GTTCTGCCAC GAAGAAGTAC 
9061 ATAACCCAGC GTCGCAACGT GGATTCGTTG ATATCCCCCA AGGCCTCAAO GCGCTCCATG 
9121 GCCTCOTAGA AGTCCAOGGC GAAGTTGAAA AACTGGOAGT TGCGCGCCGA CAOGGTTAAC 
9181 TCCTCCTCCA GAAGACGGAT GAGCTOGGCG ACAGTGTCGC GCACCTCGCG CTCAAAGGCT 
9241 ACAGGGGCCT CTTCTTCTTC TTCAATCTCC TCTTCCATAA GGGCCTCCCC TTCTTCTTCT 
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9301 TCTGGCGGCG GTGGGGGAGG GGGGACACGG 
9361 ACAAAGCGCT CGATCATCTC CCCGCGGCGA 
9421 TTCTCGCGGG GGCGCAGTTG GAAGACGCCG 
9481 GGG CTGCCAT GCGGCAGGGA TACGGCGCTA 
9541 ACTCCGCCGC CGAGGGACCT GAOCOAGTCC 
9601 AA6GCGTCTA ACCAGTCACA GTCGCAAGGT 
9661 CGGCGGTCGG GGTTGTTTCT GGCGGAGGTG 
9721 TTGAGACGGC GGATGGTCGA CAGAAGCACC 
9781 AGGCGGTCGG CCATGCCCCA GGCTTCGTTT 
9841 TGCATGAGCC TTTCTACCGG CACTTCTTCT 
9901 TCTATCGCTG CGGCGGOGGC GGAGTTTGGC 
9961 GTGACCCCGA AGCCCCTCAT CGGCTGAAGC 
10021 AATATGGCCT GCTGCACCTG CGTGAGGGTA 
10081 TGGTATGCGC CCGTGTTGAT GGTGTAAGTG 
10141 TGGTGACCCG GCTGCGAGAG CTCGGTGTAC 
10201 ACGTAGTCGT TGCAAGTCCG CACCAGGTAC 
10261 TGGCGGTAGA GGGGCCAGCG TAGGGTGGCC 
10321 AGGCGATGAT ATCCGTAGAT GTACCTGGAC 
10381 GCGCGCGGAA AGTCGCGGAC GCGGTTCCAG 
10441 GTCGGGACGC TCTGGCCGGT CAGGCGCGCG 
10501 GAGCCTGTAA GCGGGCACTC TTCCGTGGTC 
10561 GGACGACCGG GGTTCGAGCC CCGTATCCGG 
10621 CGCGTGTCGA ACCCAGGTGT GCGACGTCAG 
10681 TCCAGGCGCG GCGGCTGCTG CGCTAGCTTT 
10741 GTTAGGCTGG AAAGCGAAAG CATTAAGTGG 
10801 CAAGGGTTGA GTCGCGGGAC CCCCGGTTCG 
10861 GGGGTTTGCC TCCCCGTCAT GCAAGACCCC 
10921 GCCCCTTTTT TGCTTTTCCC AGATGCATCC 
10981 GCAGCGGGAA GAGGAAGAGC AGCGGCAGAC 
11041 GTCAGGAGGG GCGACATCCG CGGTTGACGC 
11101 GCGCCGGGCC CGGCACTACC TGGACTTGGA 
11161 GCCCTCTCCT GAGCGGTACC CAAGGGTGCA 
11221 GCCGCGGCAG AACCTGTTTC GCGACCGCGA 
11281 AAAGTTCCAC GCAGGGCGCG AGCTGCGGCA 
11341 GGAGGACTTT GAGCCCGACG CGCGAACCGG 
11401 CGCCGACCTG GTAACCGCAT ACGAGCAGAC 
11461 CTTTAACAAC CACGTGCGTA CGCTTGTGGC 
11521 TCTGTGGGAC TTTGTAAGCG CGCTGGAGCA 
11581 GCTGTTCCTT ATAGTGCAGC ACAGCAGGGA 
11641 CATAGTAGAG CCCGAGGGCC GCTGGCTGCT 
11701 GGTGCAGGAG CGCAGCTTGA GCCTGGCTGA 
11761 TAGCCTGGGC AAGTTTTACG CCCGGAAGAT 
11821 GGAGGTAAAG ATCGAGGGGT TCTACATGCG 
11881 CGACCTGGGC GTTTATCGCA ACGAGCGCAT 
11941 CGAGCTCAGC GACCGCGAGC TGATGCACAG 
12001 CGGCGATAGA GAGGCCGAGT CCTACTTTGA 
12061 CCGACGCGCC CTGGAGGCAG CTGGGGCCGG 
12121 TGGCAACGTC GGO0QO 8 TGQ AGGAATATGA 
12181 CGAGTACTAA GCGGTGATGT TTCTGATCAG 
12241 CGGGCGGCGC TGCAGAGCCA GCCGTCCGGC 
12301 ATGGACCGCA TCATGTCGCT GACTGCGCGC 
12361 GCCAACCGGC TCTCCGCAAT TCTGGAAGCG 
12421 GAGAAGGTGC TGGCGATCGT AAACGCGCTG 
12481 GCCGGCCTGG TCTACGACGC GCTGCTTCAG 
12541 CAGACCAACC TGGACCGGCT GGTGGGGGAT 
12601 GCGCAGCAGC AGGGCAACCT GGGCTCCATG 
12661 CCCGCCAACG TGCCGCGGGG ACAGGAGGAC 
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CGGCGACGAC GGCGCACCGG GAGGCGGTCG 
CGGCGCATGG TCTCGGTGAC GGCGCGGCCG 
CCCGTCATGT CCCGGTTATG GGTTGGOGGG 
ACGATGCATC TCAACAATTG TTGTGTAGGT 
GCATCGACCG GATCGGAAAA CCTCTCGAGA 
AGGCTGAGCA CCGTGGCGGG CGGCAGCGGG 
CTGCTGATGA TGTAATTAAA GTAGGCGGTC 
ATGTCCTTGG GTCCGGCCTG CTGAATGCGC 
TGACATCGGC GCAGGTCTTT GTAGTAGTCT 
TCTCCTTCCT CTTGTCCTGC ATCTCTTGCA 
CGTAGGTGGC GCCCTCTTCC TCCCATGCGT 
AGGGCTAGGT CGGCGACAAC GCGCTCGGCT 
GACTGGAAGT CATCCATGTC CACAAAGCGG 
CAGTTGGCCA TAACGGACCA GTTAACGGTC 
CTGAGACGCG AGTAAGCCCT CGAGTCAAAT 
TGGTATCCCA CCAAAAAGTG CGGCGGOGGC 
GGGGCTCCGG GGGCGAGATC TTCCAACATA 
ATCCAGGTGA TGCCGGCGGC GGTGGTGGAG 
ATGTTGCGCA GCGGCAAAAA GTGCTCCATG 
CAATCGTTGA CGCTCTAGCG TGCAAAAGGA 
TGGTGGATAA ATTCGCAAGG GTATCATGGC 
CCGTCCGCCG TGATCCATGC GGTTACCGCC 
ACAACGGGGG AGTGCTCCTT TTGGCTTCCT 
TTTGGCCACT GGCCGCGCGC AGCGTAAGCG 
CTCGCTCCCT GTAGCCGGAG GGTTATTTTC 
AGTCTCGGAC CGGCCGGACT GCGGOGAACG 
GCTTGCAAAT TCCTCCGGAA ACAGGGACGA 
GGTGCTGCGG CAGATGCGCC CCCCTCCTCA 
ATGCAGGGCA CCCTCCCCTC CTCCXACCGC 
GGCAGCAGAT GGTGATTACG AACCCCCGCG 
GGAGGGCGAG GGCCTGGCGC GGCTAGGAGC 
GCTGAAGCGT GATACGCGTG AGGCGTACGT 
GGGAGAGGAG CCCGAGGAGA TGCGGGATCG 
TGGCCTGAAT CGCGAGCGGT TGCTGCGCGA 
GATTAGTCCC GCGCGCGCAC ACGTGGCGGC 
GGTGAACCAG G AGATTAACT TTCAAAAAAG 
GCGCGAGGAG GTGGCTATAG GACTGATGCA 
AAACCCAAAT AGCAAGCCGC TCATGGCGCA 
CAACGAGGCA TTCAGGGATG CGCTGCTAAA 
CGATTTGATA AACATCCTGC AGAGCATAGT 
CAAGGTGGCC GCCATCAACT ATTCCATGCT 
ATACCATACC CCTTACGTTC CCATAGACAA 
CATGGCGCTG AAGGTGCTTA CCTTGAGCGA 
CCACAAGGCC GTGAGCGTGA GCCGGCGGCG 
CCTGCAAAGG GCCCTGGCTG GCACGGGCAG 
CGCGGGCGCT GACCTGCGCT GGGCCCCAAG 
ACCTGGGCTG GCGGTGGCAC CCGCGCGCGC 
CGAGGACGAT GAGTACGAGC CAGAGGACGG 
ATGATGCAAG ACGCAACGGA CCCGGCGGTG 
CTTAACTCCA CGGACGACTG GCGCCAGOTC 
AATCCTGACG CGTTCCGGCA GCAGCCGCAG 
GTGGTCCCGG CGOGCGCAAA CCCCAOGCAC 
GCCGAAAACA GGGCCATCCG GCCCGACGAG 
CGCGTGGCTC GTTACAACAG CGGCAACGTG 
GTGCGCGAGG CCGTGGCGCA GCGTGAGCGC 
GTTGCACTAA ACGCCTTCCT GAGTACACAG 
TACACCAACT TTGTGAGCGC ACTGOGGCTA 
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12721 ATGGTGACTG AGACACCGCA AAGTGAGGTG TACCAGTCTG GGCCAGACTA TTTTTTCCAG 
12781 ACCAGTAGAC AAGGCCTGCA GACCGTAAAC CTGAGCCAGG CTTTCAAAAA CTTGCAGGGG 
12841 CTGTGGGGGG TGCGGGCTCC CACAGGCGAC C6CGCGACCG TGTCTAGCTT GCTGACGCCC 
12901 AACTCGCGCC TGTTGCTGCT GCTAATAGCG CCCTTCACGG ACAGTGGCAG CGTQTCCCGG 
12961 GACACATACC TAGGTCACTT GCTGACACTG TACCGCGAGG CCATAGOTCA GGCGCATGTG 
13021 GACGAGCATA CTTTCCAGGA GATTACAAGT GTCAGCCGCG CGCTGGGGCA GGAGGACACG 
13081 GGCAGCCTGG AGGCAACCCT AAACTACCTG CTGACCAACC GGCGGCAGAA GATCCCCTCG 
13141 TTGCACAGTT TAAACAGCGA GGAGGAGCGC ATTTTGCGCT ACGTGCAGCA' GAGCGTGAGC 
13201 CTTAACCTGA TGCGCGACGG GGTAACGCCC AGCGTGGCGC TGGACATGAC CGCGCGCAAC 
13261 ATGGAACCGG GCATGTATGC CTCAAACCGG CCGTTTATCA ACCGCCTAAT GGACTACTTG 
13321 CATCGCGCGG CCGCCGTGAA CCCCGAGTAT TTCACCAATG CCATCTTGAA CCCGCACTGG 
13381 CTACCGCCCC CTGGTTTCTA CACCGGGGGA TTCGAGGTGC CCGAGGGTAA CGATGGATTC 
13441 CTCTGGGACG ACATAGACGA CAGCGTGTTT TCCCCGCAAC CGCAGACCCT GCTAGAOTTG 
13501 CAACAGCGCG AGCAGGCAGA GGCGGCGCTG CGAAAGGAAA GCTTCCGCAG GCCAAGCAGC 
13561 TTGTCCGATC TAGGCGCTGC GGCCCCGCGG TCAGATGCTA GTAGCCCATT TCCAAGCTTG 
13621 ATAGGGTCTC TTACCAGCAC TCGCACCACC CGCCCGCGCC TGCTGGGCGA GGAGGAGTAC 
13681 CTAAACAACT CGCTGCTGCA GCCGCAGCGC GAAAAAAACC TGCCTCCGGC ATTTCCCAAC 
13741 AACGGGATAG AGAGCCTAGT GGACAAGATG AGTAGATGGA AGACGTACGC GCAGGAGCAC 
13801 AGGGACGTGC CAGGCCCGCG CCCGCCCACC CGTCGTCAAA GGCACOACCG TCAGCGGGGT 
13861 CTGGTGTGGG AGGACGATGA CTCGGCAGAC GACAGCAGCG TCCTGGATTT GGGAGGGAGT 
13921 GGCAACCCGT TTGCGCACCT TCGCCCCAGG CTGGGGAGAA TGTTTTAAAA AAAAAAAAGC 
13981 ATGATGCAAA ATAAAAAACT CACCAAGGCC ATGGCACCGA GCGTTGGTTT TCTTGTATTC 
14041 CCCTTAGTAT GCGGCGCGCG GCGATGTATG AGGAAGGTCC TCCTCCCTCC TACGAGAGTG 
14101 TGGTGAGCGC GGCGCCAGTG GCGGCGGCGC TGGGTTCTCC CTTCGATGCT CCCCTGGACC 
14161 CGCCGTTTGT GCCTCCGCGG TACCTGCGGC CTACCGGGGG GAGAAACAGC ATCCGTTACT 
14221 CTGAGTTGGC ACCCCTATTC GACACCACCC GTGTGTACCT GGTGGACAAC AAGTCAACGG 
14281 ATGTGGCATC CCTGAACTAC CAGAACGACC ACAGCAACTT TCTGACCACG GTCATTCAAA 
14341 ACAATGACTA CAGCCCGGGG GAGGCAAGCA CACAGACCAT CAATCTTGAC GACCGGTCGC 
14401 ACTGGGGCGG CGACCTGAAA ACCATCCTGC ATACCAACAT GCCAAATGTG AACGAGTTCA 
14461 TGTTTACCAA TAAGTTTAAG GCGCGGGTGA TGGTGTCGCG CTTGCCTACT AAGGACAATC 
14521 AGGTGGAGCT GAAATACGAG TGGGTGGAGT TCACGCTGCC CGAGGGCAAC TACTCCGAGA 
14581 CCATGACCAT AGACCTTATG AACAACGCGA TCGTGGAGCA CTACTTGAAA GTGGGCAOAC 
14641 AGAACGGGGT TCTGGAAAGC GACATCGGGG TAAAGTTTGA CACCCGCAAC TTCAGACTGG 
14701 GGTTTGACCC CGTCACTGGT CTTGTCATGC CTGGGGTATA TACAAACGAA GCCTTCCATC 
14761 CAGACATCAT TTTGCTGCCA GGATGCGGGG TGGACTTCAC CCACAGCCGC CTGAGCAACT 
14821 TGTTGGGCAT CCGCAAGCGG CAACCCTTCC AGGAGGGCTT TAGGATCACC TACGATGATC 
14881 TGGAGGGTGG TAACATTCCC GCACTGTTGG ATGTGGACGC CTACCAGGCG AGCTTGAAAG 
14941 ATGACACCGA ACAGGGCGGG GGTGGCGCAG GCGGCAGCAA CAGCAGTGGC AGCGGCGCGG 
15001 AAGAGAACTC CAACGCGGCA GCGGCGGCAA TGCAGCCGGT GGAGGACATG AACGATCATG 
15061 CCATTCGCGG CGACACCTTT GCGACACGGG CTGAGGAGAA GCGCGCTGAG GCOGAAGCAG 
15121 CGGCCGAAGC TGCCGCCCCC GCTGOGCAAC CCGAGGTCGA GAAGCCTCAG AAGAAACCGG 
15181 TGATCAAACC CCTGACAGAG GACAGCAAGA AACGCAGTTA CAACCTAATA AGCAATGACA 
15241 GCACCTTCAC CCAGTACOGC AGCTGGTACC TTGCATACAA CTACGGCGAC CCTCAGACCG 
15301 GAATCCGCTC ATGGACCCTG CTTTGCACTC CTGACGTAAC CTGCGGCTCG GAGCAGGTCT 
15361 ACTGGTCGTT GCCAGACATG ATGCAAGACC CCGTGACCTT CCGCTCCACG CGCCAGATCA 
15421 GCAACTTTCC GGTGGTGGGC GCCGAGCTGT TGCCCGTGCA CTCCAAGAGC TTCTACAACG 
15481 ACCAGGCGGT CTACTCCCAA CTCATCCGCC AGTTTACCTC TCTGACCCAC GTGTTCAATC 
15541 GCTTTCCCGA GAACCAGATT TTGGOGCGCC CGCCAGCCCC CACCATCACC ACCGTCAGTG 
15601 AAAACGTTCC TGCTCTCACA GATCACGGGA CGCTACCGCT GCGCAACAGC ATCGGAGGAG 
15661 TCCAGOGAOT GACCATTACT GAOGCCAGAC GCCGCACCTG CCCCTACOTT TACAAGGCCC 
15721 TGGGCATAGT CTCGCOGCGC GTCCTATCGA GCCGCACTTT TTGAGCAAGC ATGTCCATCC 
15781 TTATATOGCC CAOCAATAAC ACAGGCTGGG GCCTGCGCTT CCCAAGCAAG ATGTTTGGCG 
15841 GGGCCAAOAA GCGCTCCGAC CAACACCCAG TGCGCGTGCG CGGGCACTAC CGCGCGCCCT 
15901 GGGGCGOGCA CAAACGCGGC CGCACTGGGC GCACCACCGT CGATGACGCC ATCGACGCGG 
15961 TGGTGGAGGA GGOGCGCAAC TACACGCCCA CGCCGCCACC AGTGTCCACA GTGOACGCGG 
16021 CCATTCAGAC CGTGGTGCGC GGAGCCCGGC GCTATGCTAA AATGAAGAGA CGGCGGAGGC 
16081 GCGTAGCACG TCGCCACCGC CGCOGACCCG GCACTGCOGC CCAACGCGCO GCGGOGGCCC 
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16141 TGCTTAACCG CGCACGTCGC ACCGGCCGAC GGGCGGCCAT GCGGGCCGCT CGAAGGCTGG 
16201 CCGCGGGTAT TC5TCACTGTG CCCCCCAGGT CCAGGCGACG AGCGGCCGCC GCAGCA6CCQ 
16261 CGGCCATTAG TGCTATGACT CAGGGTCGCA GGGGCAACGT GTATTGGGTO CGCGACTCGG 
16321 TTAGCGGCCT GCGCGTGCCC GTGCGCACCC GCCCCCCGCG CAACTAGATT GCAAGAAAAA 
16381 ACTACTTAGA CTCGTACTGT TGTATGTATC CAGCGGCGGC GGCGCGCAAC GAAGCTATGT 
16441 CCAAGCGCAA AATCAAAGAA GAGATGCTCC AGGTCATCGC GCCGGAGATC TATGGCCCCC 
16501 CGAAGAAGGA AGAGCAGGAT TACAAGCCCC GAAAGCTAAA GCGGGTCAAA AAGAAAAAGA 
16561 AAGATGATGA TGATGAACTT GACGACGAGG TGGAACTGCT GCACGCTACC GCGCCCAGGC 
16621 GACGGGTACA GTGGAAAGGT CGACGCGTAA AACGTGTTTT GCGACCCGGC ACCACCGTAG 
16681 TCTTTACGCC CGGTGAGCGC TCCACCCGCA CCTACAAGCG CGTGTATGAT GAGGTGTACG 
16741 GCGACGAGGA CCTGCTTGAG CAGGCCAACG AGCGCCTCGG GGAGTTTGCC TACGGAAAGC 
16801 GGCATAAGGA CATGCTGGCG TTGCCGCTGG ACGAGGGCAA CCCAACACCT AGCCTAAAGC 
16861 CCGTAACACT GCAGCAGGTG CTGCCCGCGC TTGCACCGTC CGAAGAAAAG CGCGGCCTAA 
16921 AGCGCGAGTC TGGTGACTTG GCACCCACCG TGCAGCTGAT GGTACCCAAG CGCCAGCGAC 
16981 TGGAAGATGT CTTGGAAAAA ATGACCGTGG AACCTGGGCT GGAGCCCGAG GTCOGCOTGC 
17041 GGCCAATCAA GCAGGTGGCG CCGGGACTGG GCGTGCAGAC CGTGGACGTT CAGATACCCA 
17101 CTACCAGTAG CACCAGTATT GCCACOGCCA CAGAGGGCAT GGAGACACAA ACGTCCCOGG 
17161 TTGCCTCAGC GGTGGCGGAT GCCGCGGTGC AGGOGGTCGC TGCGGCCGCG TCCAAGACCT 
17221 CTACGGAGGT GCAAAOGGAC CCGTGGATGT TTCGCGTTTC AGCCCCCCGG CGCCCGCGCG 
17281 GTTCGAGGAA GTACGGOGCC GCCAGCGCGC TACTGCCCGA ATATGCCCTA CATCCTTCCA 
17341 TTGOGCCTAC CCCCGGCTAT CGTGGCTACA CCTACCGCCC CAGAAGACGA GCAACTACCC 
17401 GACGCCGAAC CACCACTGGA ACCCGCCGCC GCCGTCGCCG TCGCCAGCCC GTGCTGGCCC 
17461 CGATTTCCGT GCGCAGGGTG GCTCGCGAAG GAGGCAGGAC CCTGGTGCTG CCAACAGCGC 
17521 GCTACCACCC CAGCATCGTT TAAAAGCCGG TCTTTGTGGT TCTTGCAGAT ATGGCCCTCA 
17581 CCTGCCGCCT CCGTTTCCCG GTGCCGGGAT TCCGAGGAAG AATGCACCGT AGGAGGGGCA 
17641 TGGCCGGCCA CGGCCTGACG GGCGGCATGC GTCGTGCGCA CCACCGGCGG CGGCGCGCGT 
17701 CGCACCGTCG CATGCGCGGC GGTATCCTGC CCCTCCTTAT TCCACTGATC GCCGCGGCGA 
17761 TTGGCGCCGT GCCCGGAATT GCATCCGTGG CCTTGCAGGC GCAGAGACAC TGATTAAAAA 
17821 CAAGTTGCAT GTGGAAAAAT CAAAATAAAA AGTCTGGACT CTCACGCTCG CTTGGTCCTG 
17881 TAACTATTTT GTAGAATGGA AGACATCAAC TTTGCGTCTC TGGCCCCGOG ACACGGCTCG 
17941 CGCCCGTTCA TGGGAAACTG GCAAGATATC GGCACCAGCA ATATGAGCGG TGGCGCCTTC 
18001 AGCTGGGGCT CGCTGTGGAG CGGCATTAAA AATTTCGGTT CCACCGTTAA GAACTATGGC 
18061 AGCAAGGCCT GGAACAGCAG CACAGGCCAG ATGCTGAGGG ATAAGTTGAA AGAGCAAAAT 
18121 TTCCAACAAA AGGTGGTAGA TGGCCTGGCC TCTGGCATTA GCGGGGTGGT GGACCTGGCC 
18181 AACCAGGCAG TGCAAAATAA GATTAACAGT AAGCTTGATC CCCGCCCTCC CGTAGAGGAG 
18241 CCTCCACCGG CCGTGGAGAC AGTGTCTCCA GAGGGGCGTG GCGAAAAGCG TCCGCGCCCC 
18301 GACAGGGAAG AAACTCTGGT GACGCAAATA GACGAGCCTC CCTCGTACGA GGAGGCACTA 
18361 AAGCAAGGCC TGCCCACCAC CCGTCCCATC GCGCCCATGG CTACCGGAGT GCTGGGCCAG 
18421 CACACACCCG TAACGCTGGA CCTGCCTCCC CCCGCCGACA CCCAGCAGAA ACCTGTGCTG 
18481 CCAGGCCCGA COGCCGTTGT TGTAACCCGT CCTAGCCGCG CGTCCCTGCG COGCGCOGCC 
18541 AGCGGTCCGC GATCGTTGCG GCCCGTAGCC AGTGGCAACT GGCAAAGCAC ACTGAACAGC 
18601 ATCGTGGGTC TGGGGGTGCA ATCCCTGAAG CGCCGACGAT GCTTCTGAAT AGCTAACGTG 
18661 TCGTATGTGT GTCATGTATG CGTCCATGTC GCCGCCAGAG GAGCTGCTGA GCCGCCGCGC 
18721 GCCOGCTTTC CAAGATGGCT ACCCCTTCGA TGATGCCGCA GTGGTCTTAC ATGCACATCT 
18781 CGGGCCAGGA CGCCTCGGAG TACCTGAGCC CCGGGCTGGT GGAGTTTGCC CGCGCCACCG 
18841 AGACGTACTT CAGCCTGAAT AACAAGTTTA GAAACCCCAC GGTGGCGCCT ACGCACQAOG 
18901 TGACCACAGA CCGGTCCCAG CGTTTGACGC TGCGGTTCAT CCCTGTGGAC CGTGAGGATA 
18961 CTGCGTACTC GTACAAGGCG CGGTTCACCC TAGCTGTGGG TGATAACCGT GTGC TGGACA 
19021 TGGCTTCCAC GTACTTTGAC ATCCGCGGCG TGCTGGACAG GGGCCCTACT TTTAAGCCCT 
19081 ACTCTGGCAC TGCCTACAAC GCCCTGGCTC CCAAGGGTGC CCCAAATCCT TGCGAATGGG 
19141 ATGAAGCTGC TACTGCTCTT GAAATAAACC TAGAAGAAGA GGACGATGAC AACGAAGACG 
19201 AAGTAGACGA GCAAGCTGAG CAGCAAAAAA CTCACGTATT TGGGCAGGCG CCTTATTCTG 
19261 GTATAAATAT TACAAAGGAG GGTATTCAAA TAGGTGTCGA AGGTCAAACA CCTAAATATG 
19321 CCGATAAAAC ATTTCAACCT GAACCTCAAA TAGGAGAATC TCAGTGGTAC GAAACTGAAA 
19381 TTAATCATGC AGCTGGGAGA GTCCTTAAAA AGACTACCCC AATGAAACCA TGTTACGGTT 
19441 CATATGCAAA ACCCACAAAT GAAAATGGAG GGCAAGGCAT TCTTGTAAAG CAACAAAATG 
19501 GAAAGCTAGA AAGTCAAGTG GAAATGCAAT TTTTCTCAAC TACTGAGGCG ACCGCAGGCA 
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19561 ATGGTGATAA CTTGACTCCT AAAGTGGTAT TGTACAGTGA AGATQTAGAT ATAGAAACCC 
19621 CAGACACTCA TATTTCTTAC ATGCCCACTA TTAAGGAAGG TAACTCACGA GAACTAATGG 
19681 GCCAACAATC TATGCCCAAC AGGCCTAATT ACATTGCTTT TAGGGACAAT TTTATTGGTC 
19741 TAATOTATTA CAACAGCACG GGTAATATGG GTGTTCTGGC GGGCCAAGCA TCGCAGTTGA 
19801 ATGCTGTTGT AQATTTGCAA GACAGAAACA CAGAGCTTTC ATACCAGCTT TTGCTTGATT 
19861 CCATTGGTGA TAGAACCAGG TACTTTTCTA TGTGGAATCA GGCTGTTGAC AGCTATGATC 
19921 CAGATGTTAG AATTATTGAA AATCATGGAA CTGAAGATGA ACTTCCAAAT TACTGCTTTC 
19981 CACTGGGAGG TGTGATTAAT ACAGAGACTC TTACCAAGGT AAAACCTAAA ACAGGTCAGG 
20041 AAAATGGATG GGAAAAAGAT GCTACAGAAT TTTCAGATAA AAATGAAATA AGAGTTGGAA 
20101 ATAATTTTGC CATGGAAATC AATCTAAATG CCAACCTGTG GAGAAATTTC CTGTACTCCA 
20161 ACATAGCGCT GTATTTGCCC GACAAGCTAA AGTACAGTCC TTCCAACGTA AAAATTTCTG 
20221 ATAACCCAAA CACCTACGAC TACATGAACA AGCGAGTGGT GGCTCCCGGG TTAGTGGACT 
20281 GCTACATTAA CCTTGGAGCA CGCTGGTCCC TTGACTATAT GGACAACGTC AACCCATTTA 
20341 ACCACCACCG CAATGCTGGC CTGCGCTACC GCTCAATGTT GCTGGGCAAT GGTCGCTATG 
20401 TGCCCTTCCA CATCCAGGTG CCTCAGAAGT TCTTTGCCAT TAAAAACCTC CTTCTCCTGC 
20461 CGGGCTCATA CACCTACGAG TGGAACTTCA GGAAGGATGT TAACATGGTT CTGCAGAGCT 
20521 CCCTAGGAAA TGACCTAAGG GTTGACGGAG CCAGCATTAA GTTTGATAGC ATTTGCCTTT 
20581 ACGCCACCTT CTTCCCCATG GCCCACAACA CCGCCTCCAC GCTTGAGGCC ATGCTTAGAA 
20641 ACQACACCAA CGACCAGTCC TTTAACGACT ATCTCTCCGC CGCCAACATG CTCT ACCCT A 
20701 TACCCGCCAA CGCTACCAAC GTGCCCATAT CCATCCCCTC CCGCAACTGO GCGGCTTTCC 
20761 GCGGCTGGGC CTTCACGCGC CTTAAGACTA AGGAAACCCC ATCACTGGGC TCGGGCTACG 
20821 ACCCTTATTA CACCTACTCT GGCTCTATAC CCTACCTAGA TGGAACCTTT TACCTCAACC 
20881 ACACCTTTAA GAAGGTGGCC ATTACCTTTG ACTCTTCTGT CAGCTGGCCT GGCAATGACC 
20941 GCCTGCTTAC CCCCAACGAG TTTGAAATTA AGCGCTCAGT TGACGGGGAG GGTTACAACG 
21001 TTGCCCAGTG TAACATGACC AAAGACTGGT TCCTGGTACA AATGCTAGCT AACTACAACA 
21061 TTGGCTACCA GGGCTTCTAT ATCCCAGAGA GCTACAAGGA CCGCATGTAC TCCTTCTTTA 
21121 GAAACTTCCA GCCCATGAGC CGTCAGGTGG TGGATGATAC TAAATACAAG GACTACCAAC 
21181 AGGTGGGCAT CCTACACCAA CACAACAACT CTGGATTTGT TGGCTACCTT GCCCCCACCA 
21241 TGCGCGAAGG ACAGGCCTAC CCTGCTAACT TCCCCTATCC GCTTATAGGC AAGACCGCAG 
21301 TTGACAGCAT TACCCAGAAA AAGTTTCTTT GCGATCGCAC CCTTTGGCGC ATCCCATTCT 
21361 CCAGTAACTT TATGTCCATG GGCGCACTCA CAGACCTGGG CCAAAACCTT CTCTACGCCA 
21421 ACTCCGCCCA CGCGCTAGAC ATGACTTTTG AGGTGGATCC CATGGACGAG CCCACCCTTC 
21481 TTTATGTTTT GTTTGAAGTC TTTGACGTGG TCCGTGTGCA CCGGCCGCAC CGCGGCGTCA 
21541 TCGAAACCGT GTACCTGCGC ACGCCCTTCT CGGCCGGCAA CGCCACAACA TAAAGAAGCA 
21601 AGCAACATCA ACAACAGCTG CCGCCATGGG CTCCAGTGAG CAGGAACTGA AAGCCATTGT 
21661 CAAAGATCTT GGTTGTGGGC CATATTTTTT GGGCACCTAT GACAAGCGCT TTCCAGGCTT 
21721 TGTTTCTCCA CACAAGCTCG CCTGCGCCAT AGTCAATAOG GCCGGTCGCG A GACTG GGGG 
21781 CGTACACTGG ATGGCCTTTG CCTGGAACCC GCACTCAAAA ACAT GCTACC TCTTTGAGCC 
21841 CTTTGGCTTT TCTGACCAGC GACTCAAGCA GGTTTACCAG TTTGAGTACG AGTCACTCCT 
21901 GCGCCGTAGC GCCATTGCTT CTTCCCCCGA CCGCTGTATA ACGCTGGAAA AGTCCACCCA 
21961 AAGCGTACAG GGGCCCAACT CGGCCGCCTG TGGACTATTG TGCTGCATGT TTCTCCACGC 
22021 CTTTGCCAAC TGGCCCCAAA CTCCCATGGA TCACAACCCC ACCATGAACC TTATTACCGG 
22081 GGTACCCAAC TCCATGCTCA ACAGTCCCCA GGTACAGCCC ACCCTGCGTC GCAACCAGGA 
22141 ACAGCTCTAC AGCTTCCTGG AGCGCCACTC GCCCTACTTC CGCAGCCACA GTGCGCAGAT 
22201 TAGGAGCGCC ACTTCTTTTT GTCACTTGAA AAACATGTAA AAATAATGTA CTAGAGACAC 
22261 TTTCAATAAA GGCAAATGCT TTTATTTGTA CACTCTCGGG TGATTATTTA CCCCCACCCT 
22321 TGCCGTCTGC GCCGTTTAAA AATCAAAGGG GTTCTGCCGC GCATCGCTAT GCGCCACTGG 
22381 CAGGGACACG TTGCGATACT GGTGTTTAGT GCTCCACTTA AACTCAGGCA CAACCATCCG 
22441 CGGCAGCTCG GTGAAGTTTT CACTCCACAG GCTGCGCACC ATCACCAACG CGTT TAGC AQ 
22501 GTCGGGCGCC GATATCTTGA AGTCGCAOTT GGGGCCTCCG CCCTGCGCGC GCOAGTTGCG 
22561 ATACACAGGG TTGCAGCACT GGAACACTAT CAGC3GCCGGG TGGTGCAOGC TGGCCAGCAC 
22621 GCTCTTGTCG GAGATCAGAT CCGCGTCCAG GTCCTCCGOG TTGCTCAGGG CGAACGOAGT 
22681 CAACTTTGGT AGCTGCCTTC CCAAAAAGGG CGCGTGCCCA GGCTTTGAGT TGCACTCGCA 
22741 CCGTAGTGGC ATCAAAAGGT GACCGTGCCC GGTCTGGGCG TTAGGATACA GOGCCTGCAT 
22801 AAAAGCCTTG ATCTGCTTAA AAGCCACCTG AGCCTTTGOG CCTTCAGAGA AGAA CATGCC 
22861 GCAAGACTTG CCGGAAAACT GATTGGCCGG ACAGGCCGCO TCGTGCACGC AGCACCTTGC 
22921 GTCGGTGTTG GAGATCTGCA CCACATTTCG GCCCCACCGG TTCTTCACGA TCTTGGCCTT 
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22981 GCTAGACTGC TCCTTCAGCG CGCGCTGCCC GTTTTCGCTC GTCACATCCA TTTCAATCAC 
23041 GTGCTCCTTA TTTATCATAA TGCTTCCGTG TAGACACTTA AGCTCGCCTT CGATCTCAGC 
23101 GCAGCGGTGC AGCCACAACG CGCAGCCCGT GGGCTCGTGA TGCTTGTAGG TCACCTCTGC 
23161 AAACGACTGC AGGTACGCCT GCAGGAATCG CCCCATCATC GTCACAAAGG TCTTGTTGCT 
23221 GGTGAAGGTC AGCTGCAACC CGCGGTGCTC CTCGTTCAGC CAGGTCTTGC ATACGGCCGC 
232B1 CAGAGCTTCC ACTTGGTCAG GCAGTAGTTT GAAGTTCOCC TTTAGATCGT TATCCACGTG 
23341 GTACTTGTCC ATCAGCGCGC GCGCAGCCTC CATGCCCTTC TCCCACGCAG ACACGATCGG 
23401 CACACTCAGC GGGTTCATCA CCGTAATTTC ACTTTCCGCT TCGCTGGGCT CTTCCTCTTC 
23461 CTCTTGCGTC CGCATACCAC GCGCCACTGG GTCGTCTTCA TTCAGCCGCC GCACTGTGCG 
23521 CTTACCTCCT TTGCCATGCT TGATTAGCAC CGGTGGGTTG CTGAAACCCA CCATTTGTAG 
23581 CGCCACATCT TCTCTTTCTT CCTOGCTGTC CACGATTACC TCTGGTGATG GCGGGCGCTC 
23641 GGGCTTGGGA GAAGGGCGCT TCTTTTTCTT CTTGGGCGCA ATGGCCAAAT CCGCCGCCGA 
23701 GGTCGATGGC CGCGGGCTGG GTGTGCGCGG CACCAGCGCG TCTTGTGATG AGTCTTCCTC 
23761 GTCCTCGGAC TCGATACGCC GCCTCATCCG CTTTTTTGGG GGCGCCCGGG GAGGCGGCGG 
23821 CGACGGGGAC GGGGACGACA CGTCCTCCAT GGTTGGGGGA CGTCGCGCCG CACCGCGTCC 
23881 GCGCTCGGGG GTGGTTTOGC GCTGCTCCTC TTCCCGACTG GCCATTTCCT TCTCCTATAG 
23941 GCAGAAAAAG ATCATGGAGT CAGTCGAGAA GAAGGACAGC CTAACCGCCC CCTCTGAGTT 
24001 CGCCACCACC GCCTCCACCG ATGCCGCCAA CGCGCCTACC A CCTTCC CCG TCGAGGCACC 
24061 CCCGCTTGAG GAGGAGGAAG TGATTATCGA GCAGOACCCA GGTTTTGTAA GCGAAGACGA 
24121 CGAGGACCGC TCAGTACCAA CAGAGGATAA AAAGCAAGAC CAGGACAACG CAGAGGCAAA 
24181 GGAGGAACAA GTCGGGCGGG GGGACGAAAG GCATGGCGAC TACCTAGATG TGGGAGACGA 
24241 CGTGCTGTTG AAGCATCTGC AGCGCCAGTG CGCCATTATC TGCGACGCGT TGCAAGAGCG 
24301 CAGCGATCTG CCCCTCGCCA TAGCGGATGT CAGCCTTGCC TACGAACGCC ACCTATTCTC 
2436!L_ACCGCGCGTA CCCCCCAAAC GCCAAGAAAA CGGCACATGC GAGCCCAACC CGCGC CTCAA 
24421 CTTCTACCCC GTATTTGCCG TGCCAGAGGT GCTTGCCACC TATCACATCT TTTTCCAAAA 
24481 CTGCAAGATA CCCCTATCCT GCCGTGCCAA CCGCAGCCGA GCGGACAAGC AGCTGGCCTT 
24541 GCGGCAGGGC GCTGTCATAC CTGATATCGC CTCGCTCAAC GAAGTGCCAA AAATCTTTGA 
24601 GGGTCTTGGA CGCGACGAGA AGCGCGCGGC AAACGCTCTG CAACAGGAAA ACAGCGAAAA 
24661 TGAAAGTCAC TCTGGAGTGT TGGTGGAACT CGAGGGTGAC AAOGCGCGCC TAGCCGTACT 
24721 AAAACGCAGC ATCGAGGTCA CCCACTTTGC CTACCCGGCA CTTAACCTAC CCCCCAAGGT 
24781 CATGAGCACA GTCATGAGTG AGCTGATCGT GCGCCGTGCG CAGCCCCTGG AGAGGGATCC 
24841 AAATTTGCAA GAACAAACAG AGGAGGGCCT ACCCGCAGTT GGCGACOAGC AGCTAGCGCG 
24901 CTGGCTTCAA ACGCGCGAGC CTGCCGACTT GGAGGAGCGA CGCAAACTAA TGATGGCCGC 
24961 AGTGCTCGTT ACCGTGGAGC TTGAGTGCAT GCAGCGGTTC TTTCCTGACC CGGAGATGCA 
25021 GCGCAAGCTA GAGGAAACAT TGCACTACAC CTTTCGACAG GGCTACGTAC GCCAGGCCTG 
25081 CAAGATCTCC AACGTGGAGC TCTGCAACCT GGTCTCCTAC CTTGGAATTT TGCACGAAAA 
25141 CCGCCTTGGG CAAAACGTGC TTCATTCCAC GCTCAAGGGC GAGGCGCGCC GCGACTACGT 
25201 CCGCGACTGC GTTTACTTAT TTCTATGCTA CACCTGGCAG ACGGCCATGG GCGTTTGGCA 
25261 GCAGTGCTTG GAGGAGTGCA ACCTCAAGGA GCTGCAGAAA CTGCTAAAGC AAAAC TTGAA 
25321 GGACCTATGG ACGGCCTTCA ACGAGCGCTC CGTGGCCGCG CACCTGGCGG ACATCATTTT 
25381 CCCCGAACGC CTGCTTAAAA CCCTGCAACA GGGTCTGCCA GACTTCACCA GTCAAAGCAT 
25441 GTTGCAGAAC TTTAGGAACT TTATCCTAGA GCGCTCAGGA ATCTTGCCCG CC ACCTG CTG 
25501 TGCACTTCCT AGCGACTTTG TQCCCATTAA GTACCGCGAA TGCCCTCCGC CGCTTTGGGG 
25561 CCACTGCTAC CTTCTGCAGC TAGCCAACTA CCTTGCCTAC CACTCTGACA TAATGGAAGA 
25621 CGTGAGCGGT GACGGTCTAC TGGAGTGTCA CTGTCGCTGC AACCTATGCA CCCOGCACCG 
25681 CTCCCTGGTT TGCAATTCGC AGCTGCTTAA CGAAAGTCAA ATTATCGGTA CCTTTGAGCT 
25741 GCAGGGTCCC TCGCCTGACG AAAAGTCCGC GGCTCCGGGG TTGAAACTCA CTCCGGGGCT 
25801 GTGGACGTCG GCTTACCTTC GCAAATTTGT ACCTGAGGAC TACCACGCCC ACGAGATTAG 
25861 GTTCTACGAA GACCAATCCC GCCCGCCAAA TGCGGAGCTT ACCGCCTGCG TCATTACCCA 
25921 GGGCCACATT CTTGGCCAAT TGCAAGCCAT CAACAAAGCC CGCCAAGAGT TTCTGCTACG 
2S981 AAAOGOACGG GGGGTTTACT TGGACCCCCA GTCCGGCGAG GAGCTCAACC CAATCCCCCC 
26041 GCCGCCGCAG CCCTATCAGC AGCAGCCGCG GGCCCTTGCT TCCCAGGATG GCACCCAAAA 
26101 AGAAGCTGCA GCTGCCGCCO CCACCCACGG ACGAGGAGGA ATACTGGGAC AGTCAGGCAG 
26161 AGOAGGTTTT GGACGAGGAG GAGGAGGACA TGATGGAAGA CTGGGAGAGC CTAGACGAGG 
26221 AAGCTTCCGA GGTCGAAGAG GTGTCAGACG AAACACCGTC ACCCTCGGTC GCATTCCCCT 
26281 CGCCGGCGCC CCAGAAATCG GCAACCGGTT CCAGCATGGC TACAACCTCC GCTCCTCAGG 
26341 CGCCGCCGGC ACTGCCCGTT CGCCGACCCA ACCGTAGATG GGACACCACT GGAACCAGGG 
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26401 CCGGTAAGTC CAAGCAGCCG CCGCCGTTAG CCCAAGAGCA ACAACAGCGC CAAGGCTACC 
26461 GCTCATGGCG CGGGCACAAG AACGCCATAG TTGCTTGCTT GCAAGACTGT GGGGGCAACA 
26521 TCTCCTTCGC CCGCCGCTTT CTTCTCTACC ATCACGGCGT GGCCTTCCCC CGTAACATCC 
26581 TGCATTACTA CCGTCATCTC TACAGCCCAT ACTGCACCGG CGGCAGCGGC AGCGGCAGCA 
26641 ACAGCAGCGG CCACACAGAA GCAAAGGCGA CCGOATAGCA AGACTCTGAC AAAGCCCAAG 
26701 AAATCCACAG CGGCGGCAGC AGCAGGAGGA GGAGCGCTGC GTCTGGCGCC CAACGAACCC 
26761 GTATCGACCC GCGAGCTTAG AAACAGGATT TTTCCCACTC TGTATGCTAT ATTTCAACAG 
26821 AGCAGGGGCC AAGAACAAGA GCTGAAAATA AAAAACAGGT CTCTGCGATC CCTCACCCGC 
26881 AGCTGCCTGT ATCACAAAAG CGAAGATCAG CTTCGGCGCA CGCTGGAAGA CGOGGAGGCT 
26941 CTCTTCAGTA AATACTGCGC GCTGACTCTT AAGGACTAGT TTCGCGCCCT TTCTCAAATT 
27001 TAAGCGCGAA AACTACGTCA TCTCCAGCGG CCACACCCGG CGCCAGCACC TGTCGTCAGC 
27061 GCCATTATOA GCAAGGAAAT TCCCACGCCC TACATGTGGA GTTACCAGCC ACAAATGGGA 
27121 CTTGCGGCTG GAGCTGCCCA AGACTACTCA ACCCGAATAA ACTACATGAG CGCGGGACCC 
27181 CACATGATAT CCCGGGTCAA CGGAATCCGC GCCCACCGAA ACCGAATTCT CTTGGAACAG 
27241 GCGGCTATTA CCACCACACC TCGTAATAAC CTTAATCCCC GTAGTTGGCC CGCTGCCCTG 
27301 GTGTACCAGG AAAGTCCCGC TCCCACCACT GTGGTACTTC CCAGAGACGC CCAGGCCGAA 
27361 GTTCAGATGA CTAACTCAGG GGCGCAGCTT GOGGGCGGCT TTCGTCACAG GGTGCGGTCG 
27421 CCCGGGCAGG GTATAACTCA CCTGACAATC AGAGGGCGAG GTATTCAGCT CAACGACGAG 
27481 TCGGTGAGCT CCTCGCTTGG TCTCCGTCCG GACGGGACAT TTCAGATCGG CGGCGCCGGC 
27541 CGTCCTTCAT TCACGCCTCG TCAGGCAATC CTAACTCTGC AGACCTCGTC CTCTGAGCCG 
27601 CGCTCTGGAG GCATTGGAAC TCTGCAATTT ATTGAGGAGT TTGTGCCATC GGTCTACTTT 
27661 AACCCCTTCT CGGGACCTCC CGGCCACTAT CCGGATCAAT TTATTCCTAA CTTTGACGCG 
27721 GTAAAGGACT CGGCGGAOGG CTACGACTGA TAATTAAGTG GAGAGGCAGA GCAACTGCGC 
27781 CTGAAACACC TGGTCCACTG TCGCCGCCAC AAGTGCTTTG CCCGCGACTC CGGTGAGTTT 
27841 TGCTACTTTG AATTGCCCGA GGATCATATC -GAGGATCTTT GTTGCCATCT CTGTGCTGAG 
27901 TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT CGCCATCCTG TAAACGCCAC 
27961 CGTCTTCACC CGCCCAAGCA AACCAAGGCG AACCTTACCT GGTACTTTTA ACATCTCTCC 
28021 CTCTGTGATT TACAACAGTT TCAACCCAGA CGGAGTGAGT CTACGAGAGA ACCTCTCCGA 
28081 GCTCAGCTAC TCCATCAGAA AAAACACCAC CCTCCTTACC TGCCGGGAAC GTACCCTTAA 
28141 TTAAAAGTCA GGCTTCCTGG ATGTCAGCAT CTGACTTTGG CCAGCACCTG TCCCGOGGAT 
28201 TTGTTCCAGT CCAACTACAG CGACCCACCC TAACAGAGAT GACCAACACA ACCAACGCGG 
28261 CCGCCGCTAC CX3GACTTACA TCTACCACAA ATACACCCCA AGTTTCTGCC TTTGTCAATA 
28321 ACTGGGATAA CTTGGGCATG TGGTGGTTCT CCATAGCGCT TATGTTTGTA TGCCTTATTA 
28381 TTATGTGGCT CATCTGCTGC CTAAAGCGCA AACGCGCCCG ACCACCCATC TATAGTCCCA 
28441 TCATTGTGCT ACACCCAAAC AATGATGGAA TCCATAGATT GGACGGACTG AAACACATGT 
28501 TCTTTTCTCT TACAGTATGA TTAAATGAGA TTAATTAAGG AATTTCTGTC CAGTTTATTC 
28561 AGCAGCACCT CCTTGCCCTC CTCCCAGCTC TGGTATTGCA GCTTCCTCCT GGCTGCAAAC 
28621 TTTCTCCACA ATCTAAATGG AATGTCAGTT TCCTCCTGTT CCTGTCCATC CGCACCCACT 
28681 ATCTTCATGT TGTTGCAGAT GAAGCGCGCA AGACCGTCTG AAGATACCTT CAACCCCGTG 
28741 TATCCATATG ACACGGAAAC CGGTCCTCCA ACTGTGCCTT TTCTTACTCC TCCCTTTGTA 
28801 TCCCCCAATG GGTTTCAAGA GAGTCCCCCT GGGGTACTCT CTTTGCGCCT ATCCGAACCT 
28861 CTAGTTACCT CCAATGGCAT GCTTGOGCTC AAAATGGGCA ACGGCCTCTC TCTGGAOGAG 
28921 GCCGGCAACC TTACCTCCCA AAATGTAACC ACTGTGAGCC CACCTCTCAA AAAAACCAAG 
28981 TCAAACATAA ACCTGGAAAT ATCTGCACCC CTCACAGTTA CCTCAGAAGC CCTAACTGTG 
29041 GCTGCCGCCG CACCTCTAAT GGTCGOGGGC AACACACTCA CCATGCAATC ACAGGCCCCG 
29101 CTAACCGTGC ACGACTCCAA ACTTAGCATT GCCACCCAAG GACCCCTCAC AGTGTCAGAA 
29161 GGAAAGCTAG CCCTGCAAAC ATCAGGCCCC CTCACCACCA CCGATAGCAG TACCCTTACT 
29221 ATCACTGCCT CACCCCCTCT AACTACTGCC ACTGGTAGCT TGGGCATTGA CTTGAAAGAG 
29281 CCCATTTATA CACAAAATGG AAAACTAGGA CTAAAGTACG GGGCTCCTTT GCATGTAACA 
29341 GACGACCTAA ACACTTTGAC CGTAGCAACT GGTCCAGGTG TGACTATTAA TAATACTTCC 
29401 TTGCAAACTA AAGTTACTGG AGCCTTGGGT TTTGATTCAC AAGGCAATAT GCAACTTAAT 
29461 GTAOCAGGAG GACTAAGGAT TGATTCTCAA AACAGAOGCC TTATACTTGA TGTTAGTTAT 
29521 CCGTTTQATG CTCAAAACCA ACTAAATCTA AGACTAGGAC ABGGCCCTCT TTTTATAAAC 
29581 TCAGCCCACA ACTTGGATAT TAACTACAAC AAAGGCCTTT ACTTGTTTAC AGCTTCAAAC 
29641 AATTCCAAAA AGCTTGAGGT TAACCTAAGC ACTGCCAAGG GGTTQATGTT TGACGCTACA 
29701 GCCATAGCCA TTAATGCAGG AOATOGGCTT GAATTTGGTT CACCTAATGC ACCAAACACA 
29761 AATCCCCTCA AAACAAAAAT TGGCCATGGC CTAGAATTTG ATTCAAACAA GGCTATGGTT 
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29821 CCTAAACTAQ GAACTGGCCT TAGTTTTGAC AGCACAGGTG CCATTACAGT AGGAAACAAA 
29881 AATAATGATA AGCTAACTTT GTGGACCACA CCAGCTCCAT CTCCTAACTG TAGACTAAAT 
29941 GCAGAGAAAG ATGCTAAACT CACTTTGGTC TTAACAAAAT GTGGCAGTCA AATACTTGCT 
30001 ACAGTTTCAG TTTTGGCTGT TAAAGGCAGT TTGGCTCCAA TATCTGGAAC AGTTCAAAGT 
30061 GCTCATCTTA TTATAAGATT TGACGAAAAT GGAGTGCTAC TAAACAATTC CTT CCT GQ AC 
30121 CCAGAATATT GGAACTTTAG AAATGGAGAT CTTACTGAAG GCACAGCCTA TACAAACGCT 
30181 GTTGGATTTA TGCCTAACCT ATCAGCTTAT CCAAAATCTC ACOGTAAAAC TGCCAAAAGT 
30241 AACATTGTCA GTCAAGTTTA CTTAAACGGA OACAAAACTA AACCTGTAAC ACTAACCATT 
30301 ACACTAAACG 6TACACAGGA AACAGGASAC ACAACTCCAA GTGCATACTC TATGTCATTT 
30361 TCATGGGACT GGTCTGGCCA CAACTACATT AATGAAATAT TTGCCACATC CTCTTACACT 
30421 TTTTCATACA TTGCCCAAGA ATAAAGAATC GTTTGTGTTA TGTTTCAACG TGTTTATTTT 
30481 TCAATTGCAG AAAATTTCAA GTCATTTTTC ATTCAGTAGT ATAGCCCCAC CACCACATAG 
30541 CTTATACAGA TCACCGTACC TTAATCAAAC TCACAGAACC CTAGTATTCA ACCTGCCACC 
30601 TCCCTCCCAA CACACAGAGT ACACAGTCCT TTCTCCCCGG CTGGCCTTAA AAAGCATCAT 
30661 ATCATGGGTA ACAGACATAT TCTTAGGTGT TATATTCCAC ACGGTTTCCT GTCGAGCCAA 
30721 ACGCTCATCA GTGATATTAA TAAACTCCCC GGGCAGCTCA CTTAAGTTCA TGTOGCTGTC 
30781 CAGCTGCTGA GCCACAGGCT GCTGTCCAAC TTGCGGTTGC TTAACGGGCG GCGAAGGAGA 
30841 AGTCCACGCC TACATGGGGG TAGAGTCATA ATCGTGCATC AGGATAGGGC GGTGGTGCTG 
30901 CAGCAGCGCG CGAATAAACT GCTGCCGCCG CCGCTCCGTC CTGCAGGAAT ACAACATGGC 
30961 AGTGGTCTCC TCAGCGATGA TTCGCACCGC CCGCAGCATA AGGCGCCTTG TCCTCCGGGC 
31021 ACAGCAGCGC ACCCTGATCT CACTTAAATC AGCACAGTAA CTGCAGCACA GCACCACAAT 
31081 ATTGTTCAAA ATCCCACAGT GCAAGGCGCT GTATCCAAAG CTCATGGCGG GGACCACAGA 
31141 ACCCACGTGG CCATCATACC ACAAGCGCAG GTAGATTAAG TGGCGACCCC TCATAAACAC 
31201 GCTGGACATA AACATTACCT CTTTTGGCAT GTTGTAATTC ACCACCTCCC GGTACCATAT 
31261 AAACCTCTGA TTAAACATGG CGCCATCCAC CACCATCCTA AACCAGCTGG CCAAAACCTG " 
31321 CCCGCCGGCT ATACACTGCA GGGAACCGGG ACTGGAACAA TGACAGTGGA GAGCCCAGGA 
31381 CTCGTAACCA TGGATCATCA TGCTCGTCAT GATATCAATG TTGGCACAAC ACAGGCACAC 
31441 GTGCATACAC TTCCTCAGGA TTACAAGCTC CTCCCGCGTT AGAACCATAT CCCAGGGAAC 
31501 AACCCATTCC TGAATCAGCG TAAATCCCAC ACTGCAGGGA AGACCTCGCA CGTAACTCAC 
31561 GTTGTGCATT GTCAAAGTGT TACATTCGGG CAGCAGCGGA TGATCCTCCA GTATGGTAGC 
31621 GCGGGTTTCT GTCTCAAAAG GAGGTAGACG ATCCCTACTG TACGGAGTGC GCCGAGACAA 
31681 CCGAGATCGT GTTGGTCGTA GTGTCATGCC AAATGGAACG CCGGACGTAG TCATATTTCC 
31741 TGAAGCAAAA CCAGGTGCGG GCGTGACAAA CAGATCTGCG TCTCCGGTCT CGCCGCTTAG 
31801 ATCGCTCTGT GTAGTAGTTG TAGTATATCC ACTCTCTCAA AGCATCCAGG CGCCCCCTGG 
31861 CTTCGGGTTC TATGTAAACT CCTTCATGCG CCGCTGCCCT GATAACATCC ACCACCGCAG 
31921 AATAAGCCAC ACCCAGCCAA CCTACACATT CGTTCTGCGA GTCACACACG GGAGGAGCGG 
31981 GAAGAGCTGG AAGAACCATG TTTTTTTTTT TATTCCAAAA GATTATCCAA AACCTCAAAA 
32041 TGAAGATCTA TTAAGTGAAC GCGCTCCCCT CCGGTGGCGT GGTCAAACTC TACAGCCAAA 
32101 GAACAGATAA TGGCATTTGT AAGATGTTGC ACAATGGCTT CCAAAAGGCA AAOGGCCCTC 
32161 ACGTCCAAGT GGACGTAAAG GCTAAACCCT TCAGGGTGAA TCTCCTCTAT AAACATTCCA 
32221 GCACCTTCAA CCATGCCCAA ATAATTCTCA TCTCGCCACC TTCTCAATAT ATCTCTAAGC 
32281 AAATCCCGAA TATTAAGTCC GGCCATTGTA AAAATCTGCT CCAGAGOGCC CTCCACCTTC 
32341 AGCCTCAAGC AGCGAATCAT GATTGCAAAA ATTCAGGTTC CTCACAGACC TGTATAAGAT 
32401 TCAAAAGCGG AACATTAACA AAAATACCGC GATCCCGTAG GTCCCTTCGC AGGGCCAGCT 
32461 GAACATAATC GTGCAGGTCT GCACGGACCA GCGCGGCCAC TTCCCCGCCA GGAACCTTGA 
32521 CAAAAGAACC CACACTGATT ATGACACGCA TACTCGGAGC TATGCTAACC AGCGTAGCCC 
32581 CGATGTAAGC TTTGTTGCAT GGGCGGCGAT ATAAAATGCA AGGTGCTGCT CAAAAAATCA 
32641 GGCAAAGCCT CGCGCAAAAA AGAAAGCACA TCGTAGTCAT GCTCATGCAG ATAAAGGGAG 
32701 GTAAGCTCCG GAACCACCAC AGAAAAAGAC ACCATTTTTC TCTCAAACAT GTCTGCGGGT 
32761 TTCTGCATAA ACACAAAATA AAATAACAAA AAAACATTTA AACATTAGAA GCCTGTCTTA 
32821 CAACAGGAAA AACAACCCTT ATAAGCATAA GAOGGACTAC GGCCATGCCG GCGTGACCGT 
32881 AAAAAAACTG GTCACCGTGA TTAAAAAGCA CCACCGACAG CTCCTCGGTC ATGTCCGGAG 
32941 TCATAATGTA AGACTCGGTA AACACATCAG GTTGATTCAT CGGTCAGTGC TAAAAAGOGA 
33001 CCGAAATAGC CCGGGGGAAT ACAXACCCGC AGGCGTAGAG ACAACATTAC AGCCCCCATA 
33061 GGAGGTATAA CAAAATTAAT AGGAGAGAAA AACACATAAA CACCTGAAAA ACCCTCCTGC 
33121 CTAGGCAAAA TAGCACCCTC CCGCTCCAGA ACAACATACA GCGCTTCACA GCGGCAGCCT 
33181 AACAGTCAGC CTTACCAGTA AAAAAGAAAA CCTATTAAAA AAACACCACT CGACACGGCA 
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33241 CCAGCTCAAT CAGTCACAGT GTAAAAAAGG GCCAAGTGCA GAGCGAGTAT ATATAGGACT 
33301 AAAAAATGAC GTAACGGTTA AAGTCCACAA AAAACACCCA GAAAACCGCA CGCGAACCTA 
33361 CGCCCAGAAA CGAAAGCCAA AAAACCCACA ACTTCCTCAA ATCGTCACTT CCGTTTTCCC 
33421 ACGTTACGTA ACTTCCCATT TTAAGAAAAC TACAATTCCC AACACATACA AOTTACTCCG 
334 81 CCCTAAAACC TACQTCACCC GCCCCGTTCC CACGCCCCGC GCCACGTCAC AAACTCCACC 
33541 CCCTCATTAT CATATTGGCT TCAATCCAAA ATAAGGTATA TTATTGATGA TG 
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34341 bp 



DNA 



SYN 



06-FEB-1999 



REFERENCE 
AUTHORS 
JOURNAL 

FEATURES 
CDS 



ORGANISM 



Unknown. 

Unknown 

Unclassified. 

1 (bases 1 to 34341) 

Self 

Unpublished. 



BASE COUNT 
ORIGIN 



Location/Qualifiers 
1. .34341 
/gene*°KD3* 
/products » KD3 n 
7951 a 9671 c 9464 g 



7255 t 



1 CATCATCAAT AATATACCTT ATTTT G GATT GAAGCCAATA TGATAATGAO GGGGTGGA6T 
61 TTGTGACGTG GC6CGGGGCG TGGGAACGGG GCGGGTQACG TAGTAGTGTO GCGGAAGTGT 
121 GA T UTTGCAA GTGTGGCGGA ACACAT6TAA 6CGACGGATG TGGCAAAAGT GACGTTTTTG 
181 GTOTGCGCCG GTGTACACAG GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG 
241 TAAATTTGGG CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA GGGCCGCGGG 
361 GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT CTCAGGTGTT TTCCGCGTTC 
421 CGGGTCAAAG TTGGCGTTTT ATTATTATAG TCAGCTGACG TGTAGTGTAT TTATACCCGG 
481 TGAGTTCCTC AAGAGGCCAC TCTTGAGTGC CAGCGAGTAG AGTTTTCTCC TCCGAGCCGC 
541 TCCGACACCG GGACTGAAAA TGAGACATGA GGTACTGGCT GATAATCTTC CACCTCCTAG 
601 CCATTTTGAA CCACCTACCC TTCACGAACT GTATGATTTA GACGTGACGG CCCCCQAAGA 
661 TCCCAACGAG GAGGCGGTTT CGCAGATTTT TCCCGACTCT GTAATGTTGG CGGTGCAGGA 
721 AGGGATTGAC TTACTCACTT TTCOGCCGGC GCCCGGTTCT CCGGAGCCGC CTCACCTTTC 
781 CCGGCAGCCC GAGCAGCCGG AGCAGAGAGC CTTGGGTCCG GTTTGCCACG AGGCTGGCTT 
841 TCCACCCAGT GACGACGAGG ATGAAOAGGG TGAGGAGTTT GTGTTAGATT ATGTGGAGCA 
901 CCCCGGGCAC GGTTGCAGGT CTTGTCATTA TCACCGGAGG AATACGGGGG ACCCAGATAT 
961 TATGTGTTCG CTTTGCTATA TGAGGACCTG TGGCATGTTT GTCTACAGTA AG TGAAAA TT 
1021 ATGGGCAGTG GGTGATAGAG TGGTGGGTTT GGTGTGGTAA TTTTTTTTTT AATTTTTACA 
1081 GTTTTGTGGT TTAAAGAATT TTGTATTGTG ATTTTTTTAA AAGGTCCTGT GTCTGAACCT 
1141 GAGCCTGAGC CCGAGCCAGA ACCGGAGCCT GCAAGACCTA CCCGCCGTCC TAAAATGGCG 
1201 CCTGCTATCC TGAGACGCCC GACATCACCT GTGTCTAGAG AATGCAATAG TAGTACGGAT 
1261 AGCTGTGACT CCGGTCCTTC TAACACACCT CCTGAGATAC ACCCGGTGGT CCCGCTGTGC 
1321 CCCATTAAAC CAGTTGCCGT GAGAGTTGGT GGGCGTCGCC AGGCTGTGGA ATGTATCGAG 
1381 GACTTGCTTA ACGAGCCTGG GCAACCTTTG GACTTGAGCT GTAAACGCCC CAGGCCATAA 
1441 GGTGTAAACC TGTGATTGCG TGTGTGGTTA ACGCCTTTGT TTGCTGAATG AGTTGATGTA 
1501 AGTTTAATAA AGGGTGAGAT AATGTTTAAC TTGCATGGCG TGTTAAATGG GGOGGGGCTT 
1561 AAAGGGTATA TAATGCGCCG TGGGCTAATC TTGGTTACAT CTGACCTCAT GGAGGCTTGG 
1621 QA OTQTTTG Q AAGATTTTTC TGCTOT G OGT AACTTGCTQQ AACAGAGCTC TAACAGTACC 
1681 TCTTUtflTri 1 GGAGGTTTCT GTGGGGCTCA TCCCAGGCAA AGTTAGTCTG CAGAATTAAG 
1741 GAGOATTACA AGTGGOAATT TGAAGAGCTT TTGAAATCCT GTGGTGAGCT GTTTGATTCT 
1801 TTGAATCTGG GTCACCAGGC GCTTTTCCAA GAGAAG GTCA TCAAGACTTT GGATTTTTCC 
1861 ACACCGGGGC GCGCTGCGGC TGCTGTTGCT TTTTTGAGTT TTATAAAGGA TAAATGGAGC 
1921 GAAGAAACCC ATCTGAGCGG GGGGTACCTG CTGGATTTTC TGGCCATGCA TCTGTGGAGA 
1981 GCGGTTGTGA GACACAAGAA TCGCCTGCTA CTGTTQTCTT CCGTCCGCCC GGCGATAATA 
2041 CCGACGGAGG AGCAGCAGCA GCAGCAGGAG GAAGCCAGGC GGCGGCGGCA GGAGCAGAGC 
2101 CCATGGAACC CGAGAGCCGG CCTGGACCCT CGGGAATGAA TGTTGTACAG GTGGCTGAAC 
2161 TGTATCCAGA ACTGAGACGC ATTTTGACAA TTACAGAGGA TGGGCAGGGG C TAAAQG GGG 
2221 TAAAGAGGGA GCGGGGGGCT TGTGAGGCTA CAGAGGAGGC TAGOAATCTA GCTTTXAGCT 
2281 TAATGACCAG ACACCGTCCT GAGTGTATTA CTTTTCAACA GATCAAGGAT AATTGCGCTA 
2341 ATGAGCTTGA TCTGCTGGCG CAGAAGTATT CCATAGAGCA GCTGACCACT TACTGGCTGC 
2401 AGCCAGGGGA TGATTTTGAG GAGGCTATTA GGGTATATGC AAAGGTGGCA CTTAGGCCAG 
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2461 ATTGCAAGTA CAAGATCAGC AAACTTGTAA ATATCAGGAA TTGTTGCTAC ATTTCTGGGA 
2521 ACGGGGCCGA GGTGGAGATA GATACGOAGG ATAGGGTGGC CTTTAGATGT AGCATGATAA 
2581 ATATGTGGCC GGGGGTGCTT GGCATGGACG GGGTGGTTAT TATGAATGTA AGGTTTACTG 
2641 GCCCCAATTT TAGCGGTACG GTTTTCCTGG CCAATACCAA CCTTATCCTA CACGGTGTAA 
2701 GCTTCTATGG GTTTAACAAT ACCTGTGTGG AAGCCTGGAC CGATGTAAGG GTTCGGGGCT 
2761 GTGCCTTTTA CTGCTGCTGG AAGGGGGTGG TGTGTCGCCC CAAAAGCAGG GCTTCAATTA 
2821 AGAAATGCCT CTTTGAAAGG TGTACCTTGG GTATCCTGTC TOAGGGTAAC T CCAGGG TGC 
2881 GCCACAATGT GOCCTCCGAC TGTGGTTGCT TCATGCTAGT GAAAAGCGTG GCTGTGATTA 
2941 AGCATAACAT GGTATGTGGC AACTGCGAGG ACAGGGCCTC TCAGATGCTG ACCTGCTCGG 
3001 ACGGCAACTG TCACCTGCTG AAGACCATTC ACGTAGCCAG CCACTCTCGC AAGGCCTGGC 
3061 CAGTGTTTGA GCATAACATA CTGACCCGCT GTTCCTTGCA TTTGGGTAAC AGOAGGGGGG 
3121 TGTTCCTACC TTACCAATGC AATTTGAGTC ACACTAAGAT ATTGCTTGAG CCCGAGAGCA 
3181 TGTCCAAGGT GAACCTGAAC GGGGTGTTTG ACATGACCAT GAAGATCTGG AAGGTGCTGA 
3241 GGTACGATGA GACCCGCACC AGGTGCAOAC CCTGCGAGTG TCGCGG TAAA CATATTAGGA 
3301 ACCAGCCTGT GATGCTGGAT GTGACCGAGG AGCTGAGGCC CGATCACTTG QTGCTGGCCT 
3361 GCACCCGCGC TGAGTTTGGC TCTAGCGATG AAGATACAGA TTGAGGTACT GAAATG TQTG 
3421 GGCGTGGCTT AAGGGTGGGA AAGAATATAT AAGGTGGGGG TCTTATGTAG TTTTGTATCT 
3481 GTTTTCCAGC AGCCGCCGCC GCCATGAGCA CCAACTCGTT TGATGGAAGC ATTGTGAGCT 
3541 CATATTTGAC AACGCGCATG CCCCCATGGG CCGGGGTGCG TCAGAATGTG ATGGGCTCCA 
3601 GCATTGATGG TCGCCCCGTC CTGCCCGCAA ACTCTACTAC CTTGACCTAC GAGACCGTGT 
3661 CTGGAACGCC GTTGGAGACT GCAGCCTCCG CCGCCGCTTC AGCCGCTGCA GCCACCGCCC 
3721 GCGGGATTGT GACTGACTTT GCTTTCCTGA GCCOGCTTGC AAGCAGTGCA GCTTCCCGTT 
3781 CATCCGCCCG CGATGACAAG TTGACGGCTC TTTTGGCACA ATTGGATTCT TTGACCCGGG 
3841 AACTTAATGT CGTTTCTCAG CAGCTGTTGG ATCTGCGCCA GCAGGTTTCT GCCCT GAAGG 
3901 CTTCCTCCCC TCCCAATGCG GTTTAAAACA TAAATA AAAA ACCAGACTCT GTTTGGATTT 
3961 GGATCAAGCA AGTGTCTTGC TGTCTTTATT TAGGGGTTTT GCGCGCGCGG TAGGCCCGGG 
4021 ACCAGCGGTC TCGGTCGTTG AGGGTCCTGT GTATTTTTTC CAGGACGTGG TAAAGGTGAC 
4081 TCTGGATGTT CAGATACATG GGCATAAGCC CGTCTCTGGG GTGGAGGTAG CACCACTGCA 
4141 GAGCTTCATG CTCCGGGGTG GTGTTGTAGA TGATCCAGTC GTAGCAGGAG CGCTGGGCGT 
4201 GGTGCCTAAA AATGTCTTTC AGTAGCAAGC TGATTGCCAG GGGCAGGCCC TTGGTGTAAG 
4261 TGTTTACAAA GCGGTTAAGC TGGGATGGGT GCATACGTGG GGATATGAGA TGCATCTTGG 
4321 ACTGTATTTT TAGGTTGGCT ATGTTCCCAG CCATATCCCT CCGGGGATTC ATGTTGTGCA 
4381 GAACCACCAG CACAGTGTAT CCGGTGCACT TGGGAAATTT Q TCAT GTAGC TTAGAAGGAA 
4441 ATGCGTGGAA GAACTTGGAG ACGCCCTTGT GACCTCCAAG ATTTTCCATG CATTCGTCCA 
4501 TAATGATGGC AATGGGCCCA CGGGCGGCGG CCTGGGCGAA GATATTTCTG GGATCACTAA 
4561 CGTCATAGTT GTGTTCCAGG ATGAGATCGT CATAGGCCAT TTTTACAAAG CGCGGGCGGA 
4621 GGGTGCCAGA CTGCGGTATA ATGGTTCCAT CCGGCCCAGG GGCGTAGTTA CCCTCACAGA 
4681 TTTGCATTTC CCACGCTTTG AGTTCAGATG GGGGGATCAT GTCTACCTGC GGGGCGATGA 
4741 AGAAAACGGT TTCCGGGGTA GGGGAGATCA GCTGGGAAGA AAGCAGGTTC CTGAGCAGCT 
4801 GCGACTTACC GCAGCCGGTG GGCCCGTAAA TCACACCTAT TACCGGGTGC AACTGGTAGT 
4861 TAAGAGAGCT GCAGCTGCCG TCATCCCTGA GCAGGGGGGC CACTTCGTTA AGCATGTCCC 
4921 TGACTCGCAT GTTTTCCCTG ACCAAATCCG CCAGAAGGCG CTCGCCGCCC AGCGAT AGCA 
4981 GTTCTTGCAA GGAAGCAAAG TTTTTCAACG GTTTGAGACC GTCCGCCGTA GGCATGCTTT 
5041 TGAGCGTTTG ACCAAGCAGT TCCAGGCGGT CCCACAGCTC GGTCAC CTGC TCTACGGCAT 
5101 CTCGATCCAG CATATCTCCT CGTTTOGOGG GTTGGGGCGG CTTTCGCTGT ACGGCAGTAG 
5161 TCGGTGCTCG TCCAGACGGG CCAGGGTCAT GTCTTTCCAC GGGCGCAGGG TCCTCOTCAG 
5221 CGTAGTCTGG GTCACGGTOA AGGGGTGCGC TCCGGGCTGC GCGCTGGCCA GGGTGCGCTT 
5281 GAGGCTGGTC CTGCTGGTGC TGAAGCGCTG CCGOTCTTCG CCCTGCGOGT CGGCCAGGTA 
5341 GCATTTGACC ATGGTGTCAT AGTCCAGCCC CTCCGCGGCG TGGCCCTTGG CGCGCAGCTT 
5401 GCCCTTGOAG GAGGCGCCGC ACOAGGGGCA GTGCAGACTT TTOAGGGCGT AGAGCTTGGG 
5461 CGCGAGAAAT ACCGATTCCG GGGAGTAGGC ATCCGCGCCG CAGGCCCCGC AGACGGTCTC 
5521 GCATTCCACG AGCCAGGTGA GCTCTGGCCG TTCGGGGTCA AAAACCAGGT TTCCCCCATG 
5581 CTTTTTGATG CGTTTCTTAC CTCTGGTTTC CATGAGCCGG TGTCCACGCT 03GTGACQAA 
5641 AAGGCTGTCC GTGTCCCCGT ATACAGACTT GAGAGGCCTG TCCTCGAGCG GTGTTCCGCG 
5701 GTCCTCCTCG TATAGAAACT CGOACCACTC TGAGACAAAG GCTCGCGTCC AGGCCAGCAC 
5761 GAAGGAGGCT AAGTGGGAGG GGTAGCGGTC GTTGTCCACT AGGGGGTCCA CTOSCTCCAG 
5821 GGTGTGAAGA CACATGTCGC CCTCTTCGGC ATCAAGGAAG GTGATTGGTT TGTAGGTGTA 
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58B1 GGCCACGTGA CCGGGTGTTC CTGAAGGGGG GCTATAAAAG GGGGTGGGGG 
5941 CTCACTCTCT TCCGCATCGC TGTCTGCGAG GGCCAGCTGT TGGGQTGA6T ACTCCCTCTG 
6001 AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTGAT 
6061 ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCA TCCATCTGGT CAGAAAAGAC 
6121 AATCTTTTTG TTGTCAAGCT TGGTGGCAAA CGACCCGTAG AGGGCGTTGG ACAGCAACTT 
6181 GGCGATGGAG CGCAGGGTTT GGTTTTTGTC GCGATCGGCG CGCTCCTTGG CCGC3GATGTT 
6241 TAGCTGCACG TATTCGCGCG CAAOGCACCG CCATTCGGGA AAGACGGTGG TGCGCTCGTC 
6301 GGGCACCAGG TGCACGCGCC AACCGCGGTT GTGCAGGGTG ACAAGGTCAA CGCTGGTGGC 
6361 TACCTCTCCG CGTAGGCGCT CGTTGGTCCA GCAGAGGCGG CCGCCCTTGC GCGAGCAGAA 
6421 TGGCGGTAGG GGGTCTAGCT GCGTCTCGTC CGGGGGGTCT GCGTCCACGG TAAAGACCCC 
6461 GGGCAGCAGG CGCGCGTCGA AGTAGTCTAT CTTGCATCCT TGCAAGTCTA GOGCCTGCTG 
6541 CCATGCGCGG GCGGCAAGCG CGCGCTCGTA TGGGTTGAGT GGGGGACCCC ATGGCATGGG 
6601 GTGGGTGAGC GCGGAGGCGT ACATGCCGCA AATGTCGTAA ACGTAGAGGG GCTCTCTGAG 
6661 TATTCCAAGA TATGTAGGGT AGCATCTTCC ACCGCGGATG CTGGCGCGCA CGTAATCGTA 
6721 TAGTTCGTGC GAGGGAGCGA GGAGGTCGGG ACCGAGGTTG CTACGGGCGG GCTGCTCTOC 
6781 TCGGAAGACT ATCTGCCTGA AGATGGCATG TGAGTTGGAT GATATGGTTG GACGCTGGAA 
6841 GACGTTGAAG CTGGCGTCTG TGAGACCTAC CGCGTCAOGC ACGAAGGAGG CGTAGGAGTC 
6901 GCGCAGCTTG TTGACCAGCT CGGCGGTGAC CTGCACGTCT AGGGCGCAGT AGTCCAGGGT 
6961 TTCCTTGATG ATGTCATACT , TATCCTGTCC CTTTTTTTTC CACAGCTCGC GGTTGAGGAC 
7021 AAACTCTTCG OGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTOGGCCT CCOAACGGTA 
7081 AGAGCCTAGC ATGTAGAACT GGTTGACGGC CTGGTAGGCG CAGCATCCCT TTTCTACGGG 
7141 TAGCGCGTAT GCCTGCGCGG CCTTCCGGAG CGAGGTGTGG GTGAGCGCAA AGGTGTCCCT 
7201 GACCATGACT TTGAGGTACT GGTATTTGAA GTCAGTGTCG TCGCATCCGC CCTGCTCCCA 
7261 GAGCAAAAAG TCCGTGCGCT TTTTGGAACG CGGATTTGGC AGGGCGAAGG TGACATCGTT 
7321 GAAGAGTATC TTTCCCGCGC GAGGCATAAA GTTGCGTGTG ATGCGGAAGG GTCCCGGCAC 
7381 CTCGGAACGG TTGTTAATTA CCTGGGCGGC GAGCACGATC TCGTCAAAGC CGTTGATGTT 
7441 GTGGCCCACA ATGTAAAGTT CCAAGAAGCG CGGGATGCCC TTGATGGAAG GCAATTTTTT 
7501 AAGTTCCTCG TAGGTGAGCT CTTCAGGGGA GCTGAGCCCG TGCTCTGAAA GGGCCCAGTC 
7561 TGCAAGATGA GGGTTGGAAG CGACGAATGA GCTCCACAGG TCACGGGCCA TTAGCATTTG 
7621 CAGGTGGTCG CGAAAGGTCC TAAACTGGCG ACCTATGGCC ATTTTTTCTG GGGTGATGCA 
7681 GTAGAAGGTA AGCGGGTCTT GTTCCCAGCG GTCCCATCCA AGGTTCGCGG CTAGGTCTCG 
7741 CGCGGCAGTC ACTAGAGGCT CATCTCCGCC GAACTTCATG ACCAGCATGA AGGGCACGAG 
7801 CTGCTTCCCA AAGGCCCCCA TCCAAGTATA GGTCTCTACA TCGTAGGTGA CAAAGAGACG 
7861 CTCGGTGCGA GGATGCGAGC CGATCGGGAA GAACTGGATC TCCCGCCACC AATTGGAGGA 
7921 GTGGCTATTG ATGTGGTGAA AGTAGAAGTC CCTGCGAOGG GCCGAACACT CGTGCTGGCT 
7981 TTTGTAAAAA CGTGCGCAGT ACTGGCAGCG GTGCACGGGC TGTACATCCT GCACGAGGTT 
8041 GACCTGACGA CCGCGCACAA GGAAGCAGAG TGGGAATTTG AGCCCCTCGC CTGGCGGGTT 
8101 TGGCTGGTGG TCTTCTACTT CGGCTGCTTG TCCTTGACCG TCTGGCTGCT CGAGGGGAGT 
8161 TACGGTGGAT CGGACCACCA CGCOGCGCGA GCCCAAAGTC CAGATGTCCG CGOGCGGCGG 
8221 TCGGAGCTTG ATGACAACAT CGCGCAGATG GGAGCTGTCC ATGGTCTGGA GCTCCCGCGG 
8281 CGTGAGGTCA GGCGGGAGCT CCTGCAGGTT TACCTCGCAT AGAOGGGTCA GGGOGCGGGC 
8341 TAGATCCAGG TGATACCTAA TTTCCAGGGG CTGGTTGGTG GCGGCGTCGA TGGCTTGCAA 
8401 GAGGCCGCAT CCCCGCGGCG CGACTACGGT ACCGCGCGGC GGGOGGTGGG CCGCGGGGGT 
8461 GTCCTTGGAT GATGCATCTA AAAGCGGTGA CGCGGGCGAG CCCCCGGAGG TAGGGGGGGC 
8521 TCCGGACCCG CCGGGAGAGG GGGCAGGGGC ACGTCGGCGC CGGGCGCGGG CAGGAGCTGG 
8581 TGCTGCGCGC GTAGGTTGCT GGCGAACGCG ACGACGCGGC GGTTGATCTC CTGAATCTGG 
8641 CGCCTCTGCG TGAAGACGAC GGGCCCGGTG AGCTTOAGCC TGAAAGAGAG TTCGACAGAA 
8701 TCAATTTCGG TGTCGTTGAC GGCGGCCTGG CGCAAAATCT CCTGCACGTC TCCTGAGTTG 
8761 TCTTGATAGG CGATCTCGGC CATGAACTGC TCGATCTCTT CCTCCTGGAG ATCTCCGCGT 
8821 CCGGCTCGCT CCACGGTGGC GGCGAGGTCG TTGGAAATGC GGGCCATGAG CTGCGAGAAG 
8881 GCGTTGAGGC CTCCCTCGTT CCAGACGCGG CTGTAGACCA CGCCCCCTTC GGCATCGCGG 
8941 GCGCGCATGA CCACCTGCGC GAGATTGAGC TCCACGTGCC GGGCGAAGAC GGCGTAOTTT 
9001 CGCAGGCGCT GAAAGAGGTA GTTGAGGGTO GTGGCGGTGT GTTCTGCCAC GAAGAAGTAC 
9061 ATAACCCAGC GTCGCAACGT GGATTCGTTG ATATCCCCCA AGGCCTCAAG GCGCTCCATG 
9121 GCCTCGTAGA AGTCCACGGC GAAGTTGAAA AACTGGGAGT TGOGCGCCGA CACGGTTAAC 
9181 TCCTCCTCCA GAAGACGGAT GAGCTCGGCG ACAGTGTCGC GCACCTCGCG CTCAAAGGCT 
9241 ACAGGGGCCT CTTCTTCTTC TTCAATCTCC TCTTCCATAA GGGCCTCCCC TTCTTCTTCT 



FIGURE 23 
(SHEET 3) 



WO 01/04282 



PCT/US00/18971 



9301 TCTGGCGGCG GTGGGGGAGG GGGGACACGG CGGCGACGAC GGCGCACCGG GAGGCGGTCG 
9361 ACAAAGCGCT CGATCATCTC CCCGOGGCGA CGGCGCATGG TCTCGGTGAC G GCGC GGCCG 
9421 TTCTCGCGGG GGCGCAGTTG GAAGACGCCG CCCGTCATGT CCCGGTTATG GGTTGGCGOG 
9481 GGGCTGCCAT GCGGCAGGGA TACGGCGCTA ACGATGCATC TCAACAATTG TTGTGTAGGT 
9541 ACTCCGCCGC CGAGGGACCT GAGCGAGTCC GCATCGACCG GATCGGAAAA CCTCTCGAGA 
9601 AAGGCGTCTA ACCAGTCACA GTCGCAAGGT AGGCTGAGCA CCGTGGCGGG CGGCAGOGGG 
9661 CGGCGGTCGG GGTTGTTTCT GGCGGAGGTQ CTGCTGATGA TGTAATTAAA GTAGGCGGTC 
9721 TTGAGACGGC GGATGGTCGA CAGAAGCACC ATGTCCTTGG GTCCGG CCTQ CTGAATGCGC 
9781 AGGCGGTCGG CCATGCCCCA GGCTTCGTTT TGACATCGGC GCAGGTCTTT GTAGTAQTCT 
9841 TGCATGAGCC TTTCTACCGG CACTTCTTCT TCTCCTTCCT CTTGTCCTGC ATCTCTTGCA 
9901 TCTATCGCTG CGGCGGCGGC GGAGTTTGGC CGTAGGTGGC GCCCTCTTCC TCCCATGCGT 
9961 GTGACCCCGA AGCCCCTCAT CGGCTGAAGC AGGGCTAGGT CGGCGACAAC GCGCTOGGCT 
10021 AATATGGCCT GCTGCACCTG CGTGAGGGTA GACTGGAAGT CATCCATGTC CACAAAGCGG 
10081 TGGTATGCGC CCGTGTTGAT GGTGTAAGTG CAGTTGGCCA TAACGGACCA GTTAACGGTC 
10141 TGGTGACCCG GCTGCGAGAG CTCGGTGTAC CTGAGACGCG AGTAAGCCCT CGAGTCAAAT 
10201 ACGTAGTCGT TGCAAGTCCG CACCAGGTAC TGGTATCCCA CCAAAAAGTG CGGCGGCGGC 
10261 TGGCGGTAGA GGGGCCAGCG TAGGGTGGCC GGGGCTCCGG GGGCGAOATC TTCCAACATA 
10321 AGGCGATGAT ATCCGTAGAT GTACCTGGAC ATCCAGGTGA TGCOGOCGGC GGTGGTGGAG 
10381 GCGCGCGGAA AGTCGCGGAC GCGOTTCCAG ATGTTGCGCA GCGGCAAAAA GTGCTCCATG 
10441 GTCGGGACGC TCTGGCCGGT CAGGCGOGCG CAATCGTTGA CGCTCTAGCG TGCAAAAGGA 
10501 GAGCCTGTAA GCGGGCACTC TTCCGTGGTC TGGTGGATAA ATTCGCAAGG GTATCATGGC 
10561 GGACGACCGG GGTTCGAGCC CCGTATCCGG CCGTCCGCCG TGATCCATGC GGTTACCGCC 
10621 CGCGTGTCGA ACCCAGGTGT GCGACGTCAG ACAACGGGGG AGTGCTCCTT TTGGCTTCCT 
10681 TCCAGGCGCG GCGGCTGCTG CGCTAGCTTT TTTGGCCACT GGCCGCGCGC AQCGT AAGCG 
10741 GTTAGGCTGG AAAGCGAAAG CATTAAGTGG CTCGCTCCCT GTAGCCGGAG GGTTATTTTC 
10801 CAAGGGTTGA GTCGCGGGAC CCCCGGTTCG AGTCTCGGAC CGGCCGGACT GCGGCGAACG 
10861 GGGGTTTGCC TCCCCGTCAT GCAAGACCCC GCTTGCAAAT TCCTCCGGAA ACAGGGACGA 
10921 GCCCCTTTTT TGCTTTTCCC AGATGCATCC GGTGCTGCGG CAGATGOGCC CCCCTCCTCA 
10981 GCAGCGGCAA GAGCAAGAGC AGCGGCAGAC ATGCAGGGCA CCCTCCCCTC CTCCTACCGC 
11041 GTCAGGAGGG GCGACATCCG CGGTTGACGC GGCAGCAGAT GGTGATTACG AACCCCCGCG 
11101 GCGCCGGGCC CGGCACTACC TGGACTTGGA GGAGGGOGAG GGCCTGGCGC GGCTAGGAGC 
11161 GCCCTCTCCT GAGCGGTACC CAAGGGTGCA GCTGAAGCGT GATACGCGTG AGGOGTACGT 
11221 GCCGCGGCAG AACCTGTTTC GCGACCGCGA GGGAGAGGAG CCCGAGGAGA TGCGGGATCG 
11281 AAAGTTCCAC GCAGGGCGCG AGCTGCGGCA TGGCCTGAAT CGCGAGCGGT TGCTGCGCGA 
11341 GGAGGACTTT GAGCCCGACG CGCGAACCGG GATTAGTCCC GCGCGCGCAC ACGTGGCGGC 
11401 CGCCGACCTC GTAACCGCAT ACGAGCAGAC GGTGAACCAG GAGATTAACT TTCAAAAAAG 
11461 CTTTAACAAC CACGTGCGTA CGCTTGTGGC GOGCGAGGAG GTGGCTATAG GACTGATGCA 
11521 TCTGTGGGAC TTTGTAAGCG CGCTGGAGCA AAACCCAAAT AGCAAGCCGC TCATGGOGCA 
11581 GCTGTTCCTT ATAGTGCAGC ACAGCAGGGA CAACGAGGCA TTCAGGGATG CGCTGCTAAA 
11641 CATAGTAGAG CCCGAGGGCC GCTGGCTGCT CGATTTGATA AACATCCTGC AGAGCATAGT 
11701 GGTGCAGGAG CGCAGCTTGA GCCTGGCTGA CAAGGTGGCC GCCATCAACT ATTCCATGCT 
11761 TAGCCTGGGC AAGTTTTACG CCCGCAAGAT ATACCATACC CCTTACGTTC CCATAGACAA 
11821 GGAGGTAAAG ATCGAGGGGT TCTACATGCG CATGGOGCTG AAGGTGCTTA CCTTGAGCGA 
11881 CGACCTGGGC GTTTATCGCA ACGAGCGCAT CCACAAGGCC GTOAGCGTGA GCCGGCGGCG 
11941 CGAGCTCAGC GACCGCGAGC TGATGCACAG CCTGCAAAGG GCCCTGGCTG GCAOGGGCAG 
12001 CGGCGATAGA GAGGCCGAGT CCTACTTTGA OGCGGGCGCT GACCTGOGCT GGGCCCCAAG 
12061 CCGACGCGCC CTGGAGGCAG CTGGGGCCGG ACCTGGGCTG GCGGTGGCAC CCGCGCGCGC 
12121 TGGCAACGTC GGCGGCGTGG AGGAATATGA CGAGGACGAT GAGTACGAGC CAGAGGAOGG 
12181 CGAGTACTAA GCGGTGATGT TTCTGATCAG ATGATGCAAG ACGCAACGGA CCOGGOGGTG 
12241 CGGGCGGCGC TGCAGAGCCA GCCGTCCGGC CTTAACTCCA CGGACGACTG GO0CCAGOTC 
12301 ATGOACOGCA TCATGTCGCT GACTGCGCGC AATCCTGACG CGTTCCGGCA GCAGCOGCAG 
12361 GCCAACCGGC TCTCCGCAAT TCTGGAAGCG GTGGTCCCGG CGCGCGCAAA CCCCAOGCAC 
12421 GAGAAGGTGC TCGCGATCGT AAACGOGCTG GCCGAAAACA GGGCCATCCG GCCCX3AOGM 
12481 GCCGGCCTGG TCTACGAOGC GCTGCTTCAG CGCGTGGCTC GTTACAACAG CGGCAACOTQ 
12541 CAGACCAACC TGGACCGGCT GGTGGGGGAT GTGGGGGAGG CCGTGGCGCA GCGTOAGCGC 
12601 GCGCAGCAGC AGGGCAACCT GGGCTCCATG GTTGCACTAA ACGCCTTCCT GAGTACACAG 
12661 CCCGCCAACG TGCCGCGGGG ACAGGAGOAC TACACCAACT TTGTGAGCGC ACTGCGGCTA 
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12721 ATGGTGACTG AOACACCGCA AAGTGAGGTG TACCAGTCTG GGCCAGACTA TTTTTTCCAG 
12781 ACCAGTAOAC AAGGCCTGCA GACCGTAAAC CTGA6CCAGG CTTTCAAAAA CTTGCAGGGG 
12841 CTGTGGGGGG TGCGGGCTCC CACAGGCGAC CGCGCGACCG TQTCTAGCTT GCTGACGCCC 
12901 AACTCGCGCC TGTTGCTGCT GCTAATAGCG CCCTTCACGG ACAGTGGCAG CGTGTCCCGG 
12961 GACACATACC TAGGTCACTT GCTGACACTG TACCGCGAGG CCATAGGTCA GGCGCATGTG 
13021 GACGAGCATA CTTTCCAGGA GATTACAAGT GTCAGCCGCG CGCTGGGGCA GGAGGACACG 
13081 GGCAGCCTGG AGGCAACCCT AAACTACCTG CTGACCAACC GGCGGCAGAA GATCCCCTCG 
13141 TTGCACAGTT TAAACAGCGA GGAGGAOOGC ATTTTGCGCT ACGTGCAGCA GAGCGTGAGC 
13201 CTTAACCTGA TGCGCGACGG GGTAACGCCC AGCGTGGCGC TGGACATGAC CGCGCGCAAC 
13261 ATGGAACCGG GCATGTATGC CTCAAACCGG CCGTTTATCA ACCGCCTAAT GGACTACTTG 
13321 CATCGCGOGG CCGCCOTGAA CCCCGAGTAT TTCACCAATG CCATCTTGAA CCCGCACTGG 
13381 CTACCGCCCC CTGGTTTCTA CACCGOGGGA TTCGAGGTGC CCGAGGOTAA CGATGGATTC 
13441 CTCTGGGACG ACATAGACGA CAGCGTGTTT TCCCCGCAAC CGCAGACCCT GCTAGAGTTG 
13501 CAACAGCGCG AGCAGGCAGA GGCGGCGCTG CGAAAGGAAA GCTTCCGCAG GCCAAGCAGC 
13561 TTGTCCGATC TAGGCGCTGC GGCCCOGCGG TCAGATGCTA GTAGCCCATT TCCAAGCTTG 
13621 ATAGGGTCTC TTACCAGCAC TCGCACCACC CGCCCGCGCC TGCTGGGCGA GGAGGAGTAC 
13681 CTAAACAACT CGCTGCTGCA GCCGCAGCGC GAAAAAAACC TGCCTCCGGC ATTTCCCAAC 
13741 AACGGGATAG AGAGCCTAGT GGACAAGATG AGTAGATGGA AGACGTACGC GCAGGAGCAC 
13801 AGGGACGTGC CAGGCCCGCG CCCGCCCACC CGTCGTCAAA GGCACGACCG TCAGCGGGGT 
13861 CTGGTGTGGG AGGACGATGA CTOGGCAGAC GACAGCAGCG TCCTGGATTT GGGAGGGAGT 
13921 GGCAACCCGT TTGCGCACCT TCGCCCCAGG CTGGGGAGAA TGTTTTAAAA AAAAAAAAGC 
13981 ATGATGCAAA ATAAAAAACT CACCAAGGCC ATGGCACCGA GCGTTGGTTT TCTTGTATTC 
14041 CCCTTAGTAT GCGGCGCGCG GCGATGTATG AGGAAGGTCC TCCTCCCTCC TACGAGAGTG 
14101 TGGTGAGCGC GGCGCCAGTG GCGGCGGCGC TGGGTTCTCC CTTCGATGCT CCCCTGGACC 
14161 CGCCGTTTGT GCCTCCGCGG TACCTGCGGC CTACCGGGGG GAGAAACAGC ATCCGTTACT 
14221 CTGAGTTGGC ACCCCTATTC GACACCACCC GTGTGTACCT GGTGGACAAC AAGTCAACGG 
14281 ATGTGGCATC CCTGAACTAC CAGAACGACC ACAGCAACTT TCTGACCACG GTCATTCAAA 
14341 ACAATGACTA CAGCCCGGGG GAGGCAAGCA CACAGACCAT CAATCTTGAC GACCGGTCGC 
14401 ACTGGGGCGG CGACCTGAAA ACCATCCTGC ATACCAACAT GCCAAATGTG AACGAGTTCA 
14461 TGTTTACCAA TAAGTTTAAG GCGCG6GTGA TGGTGTCGCG CTTGCCTACT AAGGACAATC 
14521 AGGTGGAGCT GAAATACGAG TGGGTGGAGT TCACGCTGCC CGAGGGCAAC TACTCCGAGA 
14581 CCATGACCAT AGACCTTATG AACAACGCGA TCGTGGAGCA CTACTTGAAA GTGGGCAGAC 
14641 AGAACGGGGT TCTGGAAAGC GACATCGGGG TAAAGTTTGA CACCCGCAAC TTCAGACTGG 
14701 GGTTTGACCC CGTCACTGGT CTTGTCATGC CTGGGGTATA TACAAACGAA GCCTTCCATC 
14761 CAGACATCAT TTTGCTGCCA GGATGCGGGG TGGACTTCAC CCACAGCCGC CTGAGCAACT 
14821 TGTTGGGCAT CCGCAAGCGG CAACCCTTCC AGGAGGGCTT TAGOATCACC TACGATGATC 
14881 TGGAGGGTGG TAACATTCCC GCACTGTTGG ATGTGGACGC CTACCAGGCG AGCTTGAAAG 
14941 ATGACACCGA ACAGGGCGGG GGTGGCGCAG GCGGCAGCAA CAGCAGTGGC AGCGGCGCGG 
15001 AAGAGAACTC CAACGCGGCA GCCGCGGCAA TGCAGCCGGT GGAGGACATG AACGATCATG 
15061 CCATTCGCGG CGACACCTTT GCCACACGGG CTGAGGAGAA GCGCGCTGAG GCCGAAGCAG 
15121 CGGCCGAAGC TGCCGCCCCC GCTGCGCAAC CCGAGGTCGA GAAGCCTCAG AAGAAACCGG 
15181 TGATCAAACC CCTGACAGAG OACAGCAAGA AACGCAGTTA CAACCTAATA AGCAATGACA 
15241 GCACCTTCAC CCAGTACCGC AGCTGGTACC TTGCATACAA CTACGGOGAC CCTCAOACCG 
15301 GAATCCGCTC ATGGACCCTG CTTTGCACTC CTGACGTAAC CTGC6GCTCG GAGCAGGTCT 
15361 ACTGGTCGTT GCCAGACATQ ATGCAAGACC CCGTGACCTT COGCTCCACG CGCCAOATCA 
15421 GCAACTTTCC GGTGGTGGGC GCCGAGCTGT TGCCCGTGCA CTCCAAOAGC TTCTACAACG 
15481 ACCAGGCCGT CTACTCCCAA CTCATCCGCC AGTTTACCTC TCTGACCCAC GTGTTCAATC 
15541 GCTTTCCOGA GAACCAOATT TTGGCGCGCC CGCCAGCCCC CACCATCACC ACCGTCAGTG 
15601 AAAACGTTCC TGCTCTCACA GATCACGGGA CGCTACCGCT GCGCAACAGC ATCGGAGGAG 
15661 TCCAGCGAGT GACCATTACT GACGCCAGAC GCCGCACCTG CCCCTACGTT TACAAGGCCC 
15721 TGGGCATAGT CTCGCCGCGC GTCCTATCGA GCCGCACTTT TTGAQCAAGC ATGTCCATCC 
15781 TTATATCGCC CAGCAATAAC ACAGGCTGGG GCCTGCGCTT CCCAAGCAAG ATGTTTGGCG 
15841 GGGCCAAGAA GGGCTCCGAC CAACACCCAG TGCGCGTGCG CGGGCACTAC CGCGCGCCCT 
15901 GGGGCGCGCA CAAACGCGGC CGCACTGGGC GCACCACCGT CGATOACGCC ATCGACGCGG 
15961 TGGTGGAGGA GGCGCGCAAC TACACGCCCA CGCCGCCACC AOTGTCCACA GTGGACGCGG 
16021 CCATTCAGAC CGTGGTQCGC GGAGCCOGGC GCTATGCTAA AATOAAOAGA CGGCGGAGGC 
16081 GOGTAGCACG TCGCCACCGC CGCOOACCCG GCACTGCCGC CCAACGCGCG GCGGCGGCGC 
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.6141 TGCTTAACCG CGCACGTCGC ACCGGCCGAC GGGCGGCCAT GCGGGCCGCT CGAAGGCTGG 
.6201 CCGCGGGTAT TGTCACTGTG CCCCCCAGGT CCAGGCGACG AGCGGCCGCC GCAGCAGCCG 
.6261 CGGCCATTAG TGCTATGACT CAGGGTCGCA GGGGCAACGT GTATTGGGTG CGCGACTCGG 
.6321 TTAGCGGCCT GCGCGTGCCC GTGCGCACCC GCCCCCCGCG CAACTAGATT GCAAGAAAAA 
.6381 ACTACTTAGA CTCGTACTGT TGTATGTATC CAGCGGOGGC GGCGCGCAAC GAAGCTATGT 
.6441 CCAAGCGCAA AATCAAAGAA GAGATGCTCC AGGTCATCGC GCCGGAGATC TATGGCCCCC 
.6501 CGAAGAAGGA AGAGCAGGAT TACAAGCCCC GAAAGCTAAA GCGGGTCAAA AAGAAAAAGA 
.6561 AAGATGATGA TGATGAACTT GACGACGAGG TGGAACTGCT GCACGCTACC GCGCCCAGGC 
.6621 GACGGGTACA GTGGAAAGGT CGACGCGTAA AACGTGTTTT GCGACCCGGC ACCACCGTAG 
.6681 TCTTTACGCC CGGTGAGCGC TCCACCCGCA CCTACAAGCG CGTGTATGAT GAGGTGTACG 
.6741 GCGACGAGGA CCTGCTTGAG CAGGCCAACG AGCGCCTCGG GGAGTTTGCC TACGGAAAGC 
.6801 GGCATAAGGA CATGCTGGCG TTGCCGCTGG ACGAGGGCAA CCCAACACCT AGCCTAAAGC 
.6861 CCGTAACACT GCAGCAGGTG CTGCCCGCGC TTGCACCGTC CGAAGAAAAG CGCGGCCTAA 
.6921 AGCGCGAGTC TGGTGACTTG GCACCCACCG TGCAGCTGAT GGTACCCAAG CGCCAGCGAC 
.6981 TGGAAGATGT CTTGGAAAAA ATGACCGTGG AACCTGGGCT GGAGCCCGAG GTCCGCGTGC 
.7041 GGCCAATCAA GCAGGTGGCG CCGGGACTGG GCGTGCAGAC CGTGGACGTT CAGATACCCA 
.7101 CTACCAGTAG CACCAGTATT GCCACCGCCA CAGAGGGCAT GGAGACACAA ACGTCCCCGG 
.7161 TTGCCTCAGC GGTGGCGGAT GCCGCGGTGC AGGCGGTOGC TGCGGCCGCG TCCAAGACCT 
.7221 CTACGGAGGT GCAAACGGAC CCGTGGATGT TTCGCGTTTC AGCCCCCCGG CGCCCGCGCG 
.7281 GTTCGAGGAA GTACGGCGCC GCCAGCGCGC TACTGCCOGA ATATOCCCTA CATCCTTCCA 
.7341 TTGCGCCTAC CCCCGGCTAT CGTGGCTACA CCTACCGCCC CAGAAGACGA GCAACTACCC 
.7401 GACGCCGAAC CACCACTGGA ACCCGCCGCC GCCGTCGCCG TCGCCAGCCC GTGCTGGCCC 
.7461 CGATTTCCGT GCGCAGGGTG GCTCGCGAAG GAGGCAGGAC CCTGGTGCTG CCAACAGCGC 
.7521 GCTACCACCC CAGCATCGTT TAAAAGCCGG TCTTTGTGGT TCTTGCAGAT ATGGCCCTCA 
.7581 CCTGCCGCCT CCGTTTCCCG GTGCCGGGAT TCCGAGGAAG AATGCACCGT AGGAGGGGCA 
.7641 TGGCCGGCCA CGGCCTGACG GGCGGCATGC GTCGTGCGCA CCACCGGCGG CGGCGCGCGT 
.7701 CGCACCGTCG CATGCGCGGC GGTATCCTGC CCCTCCTTAT TCCACTGATC GCCGCGGCGA 
.7761 TTGGCGCCGT GCCCGGAATT GCATCCGTGG CCTTGCAGGC GCAGAGACAC TGATTAAAAA 
.7821 CAAGTTGCAT GTGGAAAAAT CAAAATAAAA AGTCTGGACT CTCACGCTCG CTTGGTCCTG 
.7881 TAACTATTTT GTAGAATGGA AGACATCAAC TTTGOGTCTC TGGCCCCGCG ACACGGCTCG 
.7941 CGCCOGTTCA TGGGAAACTG GCAAGATATC GGCACCAGCA ATATGAGCGG TGGCGCCTTC 
.8001 AGCTGGGGCT CGCTGTGGAG CGGCATTAAA AATTTCGGTT CCACCGTTAA GAACTATGGC 
.8061 AGCAAGGCCT GGAACAGCAG CACAGGCCAG ATGCTGAGGG ATAAGTTGAA AGAGCAAAAT 
.8121 TTCCAACAAA AGGTGGTAGA TGGCCTGGCC TCTGGCATTA GCGGGGTGGT GGACCTGGCC 
8181 AACCAGGCAG TGCAAAATAA GATTAACAGT AAGCTTGATC CCCGCCCTCC CGTAGAGGAG 
.8241 CCTCCACCGG CCGTGGAGAC AGTGTCTCCA GAGGGGCGTG GCGAAAAGCG TCCGCGCCCC 
.8301 GACAGGGAAG AAACTCTGGT GACGCAAATA GACGAGCCTC CCTCGTACGA GGAGGCACTA 
.8361 AAGCAAGGCC TGCCCACCAC CCGTCCCATC GCGCCCATGG CTACCGGAGT GCTGGGCCAG 
.8421 CACACACCCG TAACGCTGGA CCTGCCTCCC CCCGCCGACA CCCAGCAGAA ACCTGTGCTG 
.8481 CCAGGCCCGA CCGCCGTTGT TGTAACCCGT CCTAGCCGCG CGTCCCTGCG CCGCGCCGCC 
.8541 AGCGGTCCGC GATCGTTGCG GCCCGTAGCC AGTGGCAACT GGCAAAGCAC ACTGAACAGC 
.8601 ATCGTGGGTC TGGGGGTGCA ATCCCTGAAG CGCCGACGAT GCTTCTGAAT AGCTAACGTG 
.8661 TCGTATGTGT GTCATGTATG CGTCCATGTC GCCGCCAGAG GAGCTGCTGA GCOGCCGCGC 
.8721 GCCCGCTTTC CAAGATGGCT ACCCCTTCGA TGATGCCGCA GTGGTCTTAC ATGCACATCT 
8781 CGGGCCAGGA CGCCTCGGAG TACCTGAGCC CCGGGCTGGT GCAGTTTGCC CGCGCCACCG 
8841 AGACGTACTT CAGCCTGAAT AACAAGTTTA GAAACCCCAC GGTGGCGCCT ACGCACGACG 
.8901 TGACCACAGA CCGGTCCCAG CGTTTGACGC TGCGGTTCAT CCCTGTGGAC CGTGAGGATA 
8961 CTGCGTACTC GTACAAGGCG CGGTTCACCC TAGCTGTGGG TGATAACCGT GTGCTGGACA 
9021 TGGCTTCCAC GTACTTTGAC ATCCGCGGCG TGCTGGACAG GGGCCCTACT TTEAAGCCCT 
9081 ACTCTGGCAC TGCCTACAAC GCCCTGGCTC CCAAGGGTGC CCCAAATCCT TGCGAATGGG 
9141 ATGAAGCTGC TACTGCTCTT GAAATAAACC TAGAAGAAGA GGACGATGAC AACGAAGACG 
9201 AACTAGACGA GCAAGCTGAG CAGCAAAAAA CTCACGTATT TGGGCAGGCG CCTXATTCTG 
9261 GTATAAATAT TACAAAGOAG GGTATTCAAA TAGGTGTCGA AGGTCAAACA CCTAAATATG 
.9321 CCGATAAAAC ATTTCAACCT GAACCTCAAA TAGGAGAATC TCAGTGGTAC GAAACTGAAA 
.9381 TTAATCATGC AGCTGGGAGA GTCCTTAAAA AGACTACCCC AATGAAACCA TGTTACGGTT 
9441 CATATGCAAA ACCCACAAAT GAAAATGGAG GGCAAGGCAT TCTTOTAAAG CAACAAAATG 
9501 GAAAGCTAGA AAGTCAAGTG GAAATGCAAT TTTTCTCAAC TACTGAGGCG ACCGCAGGCA 
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19561 ATGGTGATAA CTTGACTCCT AAAGTGGTAT TGTACAGTGA AGATGTAGAT ATAQAAACCC 
19621 CAGACACTCA TATTTCTTAC ATGCCCACTA TTAAGOAAGG TAACTCACGA GAACTAATGG 
19681 GCCAACAATC TATGCCCAAC AGGCCTAATT ACATTGCTTT TAGGGACAAT TTTATTGGTC 
19741 TAATGTATTA CAACAGCACG GGTAATATGG GTGTTCTGGC GGGCCAAGCA TCGCAGTTGA 
19B01 ATGCTGTTGT AGATTTGCAA GACAGAAACA CAGAGCTTTC ATACCAGCTT TTGCTTGATT 
19861 CCATTGGTQA TAQAACCAGG TACTTTTCTA TGTGGAATCA GGCTGTTGAC AGCTATGATC 
19921 CAGATGTTAG AATTATTGAA AATCATGGAA CTGAAGATGA ACTTCCAAAT TACTGCTTTC 
19981 CACTGGGAGG TGTGATTAAT ACAGAGACTC TTACCAAGGT AAAACCTAAA ACAGGTCAGG 
20041 AAAATGGATG GGAAAAAGAT GCTACAGAAT TTTCAGATAA AAATGAAATA AGAGTTGGAA 
20101 ATAATTTTGC CATGGAAATC AATCTAAATG CCAACCTGTG GAGAAATTTC CTGTACTCCA 
20161 ACATAGCGCT GTATTTGCCC GACAAGCTAA AGTACAGTCC TTCCAACGTA AAAATTTCTG 
20221 ATAACCCAAA CACCTACGAC TACATGAACA AGCGAGTGGT GGCTCCCGGG TTAGTGGACT 
20281 GCTACATTAA CCTTGGAGCA CGCTGGTCCC TTGACTATAT GGACAACGTC AACCCATTTA 
20341 ACCACCACCG CAATGCTGGC CTGCGCTACC GCTCAATGTT GCTGGGCAAT GGTCGCTATG 
20401 TGCCCTTCCA CATCCAGGTG CCTCAGAAGT TCTTTGCCAT TAAAAACCTC CTTCTCCTGC 
20461 CGGGCTCATA CACCTACGAG TGGAACTTCA GGAAGGATGT TAACATOGTT CTGCAGAGCT 
20521 CCCTAGGAAA TGACCTAAGG GTTGACGGAG CCAGCATTAA GTTTGATAGC ATTTGCCTTT 
20581 ACGCCACCTT CTTCCCCATG GCCCACAACA CCGCCTCCAC GCTTGAGGCC ATGCTTAGAA 
20641 ACGACACCAA CGACCAGTCC TTTAAOGACT ATCTCTCCGC CGCCAACATG CTCTACCCTA 
20701 TACCCGCCAA CGCTACCAAC GTGCCCATAT CCATCCCCTC CCGCAACTGG GCGGCTTTCC 
20761 GCGGCTGGGC CTTCACGCGC CTTAAOACTA AGGAAACCCC ATCACTGGGC TCGGGCTACG 
20821 ACCCTTATTA CACCTACTCT GGCTCTATAC CCTACCTAGA TGGAACCTTT TACCTCAACC 
20881 ACACCTTTAA GAAGGTGGCC ATTACCTTTG ACTCTTCTGT CAGCTGGCCT GGCAATGACC 
20941 GCCTGCTTAC CCCCAACGAG TTTGAAATTA AGCGCTCAGT TGACGGGGAG GGTTACAACG 
21001 TTGCCCAGTG TAACATGACC AAAGACTGGT TCCTGGTACA AATGCTAGCT AACTACAACA 
21061 TTGGCTACCA GGGCTTCTAT ATCCCAGAGA GCTACAAGGA CCGCATGTAC TCCTTCTTTA 
21121 GAAACTTCCA GCCCATOAGC CGTCAGGTGG TGGATGATAC TAAATACAAG GACTACCAAC 
21181 AGGTGGGCAT CCTACACCAA CACAACAACT CTGGATTTGT TGGCTACCTT GCCCCCACCA 
21241 TGCGCGAAGG ACAGGCCTAC CCTGCTAACT TCCCCTATCC GCTTATAGGC AAGACCGCAG 
21301 TTGACAGCAT TACCCAGAAA AAGTTTCTTT GCGATCGCAC CCTTTGGCGC ATCCCATTCT 
21361 CCAGTAACTT TATGTCCATG GGCGCACTCA CAGACCTGGG CCAAAACCTT CTCTACGCCA 
21421 ACTCCGCCCA CGCGCTAGAC ATGACTTTTG AGGTGGATCC CATGGACGAG CCCACCCTTC 
21481 TTTATGTTTT GTTTGAAGTC TTTGACQTGG TCCGTGTGCA CCQGCCGCAC CGCGGCGTCA 
21541 TCGAAACCGT GTACCTGCGC ACGCCCTTCT CGGCCGGCAA CGCCACAACA TAAAGAAGCA 
21601 AGCAACATCA ACAACAGCTG CCGCCATGGG CTCCAGTGAG CAGGAACTGA AAGCCATTGT 
21661 CAAAGATCTT GGTTGTGGGC CATATTTTTT GGGCACCTAT GACAAGCGCT TTCCAGGCTT 
21721 TGTTTCTCCA CACAAGCTCG CCTGCGCCAT AGTCAATACG GCCGGTCGCG AGACTGGGGQ 
21781 CGTACACTGG ATGGCCTTTG CCTGGAACCC GCACTCAAAA ACATGCTACC TCTTTGAGCC 
21841 CTTTGGCTTT TCTGACCAGC GACTCAAGCA GGTTTACCAG TTTGAGTACG AGTCACTCCT 
21901 GCGCCGTAGC GCCATTGCTT CTTCCCCCGA CCGCTGTATA ACGCTGGAAA AGTCCACCCA 
21961 AAGCGTACAG GGGCCCAACT CGGCCGCCTG TGGACTATTC TGCTGCATGT TTCTCCACGC 
22021 CTTTGCCAAC TGGCCCCAAA CTCCCATGGA TCACAACCCC ACCATGAACC TTATTACCGG 
22081 GGTACCCAAC TCCATGCTCA ACAGTCCCCA GGTACAGCCC ACCCTGCGTC GCAACCAGGA 
22141 ACAGCTCTAC AGCTTCCTGG AGCGCCACTC GCCCTACTTC CGCAGCCACA GTGCGCAGAT 
22201 TAGGAGCGCC ACTTCTTTTT GTCACTTGAA AAACATGTAA AAATAATGTA CTAGAGACAC 
22261 TTTCAATAAA GGCAAATGCT TTTATTTGTA CACTCTCGGG TGATTATTTA CCCCCACCCT 
22321 TGCCGTCTGC GCCGTTTAAA AATCAAAGGG GTTCTGCCGC GCATCGCTAT GCGCCACTGG 
22381 CAGGGACACG TTGCGATACT GGTGTTTAGT GCTCCACTTA AACTCAGGCA CAACCATCCG 
22441 CGGCAGCTCG GTGAAGTTTT CACTCCACAO GCTGCGCACC ATCACCAACG CGTTTAGCAG 
22501 GTCGGGCGCC GATATCTTGA AGTCGCAGTT GGGGCCTCCG CCCTGCGCGC GCCttGTTGCG 
22561 ATACACAGGG TTGCAGCACT GGAACACTAT CAOCGCCGGO TGGTGCACGC TGGCCAGCAC 
22621 GCTCTTGTCG GAGATCAGAT CCGCGTCCAG GTCCTCCGCG TTGCTCAGGG CGAACGOAGT 
22681 CAACTTTGGT AGCTGCCTTC CCAAAAAGGG CGCGTGCCCA GGCTTTGAGT TGCACTCGCA 
22741 CCGTAGTGGC ATCAAAAGGT GACCGTGCCC GGTCTGGGCG TTAGGATACA GCGCCTGCAT 
22801 AAAAGCCTTG ATCTGCTTAA AAGCCACCTG AGCCTTTGCQ CCTTCAGAGA AGAACATGCC 
22861 GCAAGACTTG CCGGAAAACT GATTGGCCGG ACAGGCCGCG TCGTGCACGC AGCACCTTOC 
22921 GTCGGTGTTG GAOATCTGCA CCACATTTCG GCCCCACCGG TTCTTCACGA TCTTGGCCTT 
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22981 GCTAOACTGC TCCTTCAGCG CGCGCTGCCC 
23041 GTGCTCCTTA TTTATCATAA TGCTTCCGTG 
23101 GCAOCGGTGC AGCCACAACG CGCAGCCCGT 
23161 AAACGACTGC AGGTACGCCT GCAGGAATCG 
23221 GGTGAAGGTC AGCT6CAACC CGCGGT6CTC 
23281 CAGAGCTTCC ACTTGGTCAG GCAGTAGTTT 
23341 GTACTTGTCC ATCAGCGCGC GCGCAGCCtC 
23401 CACACTCAGC GGGTTCATCA CCGTAATTTC 
23461 CTCTTGCGTC CGCATACCAC GCGCCACTGG 
23521 CTTACCTCCT TTGCCATGCT TGATTAGCAC 
23581 CGCCACATCT T CTCTTTCTT CCTCGCTGTC 
23641 GGGCTTGGGA GAAGGGCGCT TCTTTTTCTT 
23701 GGTCGATGGC CGCGGGCTGG GTGTGCGCGG 
23761 GTCCTCGGAC TCOATACGCC GCCTCATCCG 
23821 CGAOGGGGAC GGGGACGACA CGTCCTCCAT 
23881 GCGCTCGGGG GTGGTTTCGC GCTGCTCCTC 
23941 GCAGAAAAAG ATCATGGAGT CAGTCGAGAA 
24001 CGCCACCACC GCCTCCACCG ATGCCGCCAA 
24061 CCCGCTTGAG GAGGAGGAAG TGATTATCGA 
24121 CGAGGACCGC TCAGTACCAA CAGAGGATAA 
24181 CGAGGAACAA GTCGGGCGGG GGGACGAAAG 
24241 CGTGCTGTTG AAGCATCTGC AGCGCCAGTG 
24301 CAGCGATGTG CCCCTCGCCA TAGCGGATGT 
24361 ACCGCGCGTA CCCCCCAAAC GCCAAGAAAA 
24421 CTTCTACCCC GTATTTGCCG TGCCAGAGGT 
24481 CTGCAAGATA CCCCTATCCT GCCGTGCCAA 
24541 GCGGCAGGGC GCTGTCATAC CTGATATCGC 
24601 GGGTCTTGGA CGCGACGAGA AGCGCGCGGC 
24661 TGAAAGTCAC TCTGGAGTGT TGGTGGAACT 
24721 AAAACGCAGC ATCGAGGTCA CCCACTTTGC 
24781 CATGAGCACA GTCATGAGTG AGCTGATCGT 
24841 AAATTTGCAA GAACAAACAG AGGAGGGCCT 
24901 CTGGCTTCAA ACGCGCGAGC CTGCCGACTT 
24961 AGTGCTCGTT ACCGTGGAGC TTGAGTGCAT 
25021 GCGCAAGCTA GAGGAAACAT TGCACTACAC 
25081 CAAGATCTCC AACGTGGAGC TCTGCAACCT 
25141 CCGCCTTGGG CAAAACGTGC TTCATTCCAC 
25201 CCGCGACTGC GTTTACTTAT TTCTATGCTA 
25261 GCAGTGCTTG GAGGAGTGCA ACCTCAAGGA 
25321 GGACCTATGG ACGGCCTTCA ACQAGCGCTC 
25381 CCCCGAACGC CTGCTTAAAA CCCTGCAACA 
25441 GTTGCAOAAC TTTAGGAACT TTATCCTAGA 
25501 TGCACTTCCT AGCGACTTTG TGCCCATTAA 
25561 CCACTGCTAC CTTCTGCAGC TAGCCAACTA 
25621 CGTGAGCGGT GAOGGTCTAC TGGAGTGTCA 
25681 CTCCCTGGTT TGCAATTCGC AGCT G CT TA A 
25741 GGAGGGTCCC TCGCCTGACG AAAAGTCCGC 
25801 GTGGACGTCG GCTTACCTTC GCAAATTTGT 
25861 GTTCTACGAA GACCAATCCC GCCCGCCAAA 
25921 GGGCCACATT CTTGGCCAAT TGCAAGCCAT 
25981 AAAGGGACGG GGGGTTTACT TGGACCCCCA 
26041 GCCGCCGCAG CCCTATCAGC AGCAGCCGCG 
26101 AGAAGCTGCA GCTGCOGCCG CCACCCACGG 
26161 AGGAGGTTTT GGACGAGGAG GAGGAGGACA 
26221 AAGCTTCCGA GGTCGAAGAG GTGTCAGACG 
26281 CGCOGGOGCC CCAOAAATCG GCAACCGGTT 
26341 CGCCGCCGGC ACTGCCCGTT CGCCGACCCA 



GTTTTCGCTC GTCACATCCA TTTCAATCAC 
TAGACACTTA AGCTCGCCTT CGATCTCAGC 
GGGCTCGTGA TGCTTGTAGG TCACCTCTGC 
CCCCATCATC GTCACAAAGG TCTTGTTGCT 
CTCGTTCAGC CAGGTCTTGC ATACGGCCGC 
GAAGTTCGCC TTTAGATCGT TATCCACGTG 
CATGCCCTTC TCCCACGCAG ACACGATCGG 
ACTTTCCGCT TCGCTGGGCT CTTCCTCTTC 
GTCGTCTTCA TTCAGCOGCC GCACTGTGCG 
CGGTGGGTTG CTGAAACCCA CCATTTGTAG 
CACGATTACC TCTGGTGATG GOGGGCGCTC 
CTTGGGCGCA ATGGCCAAAT COGCOGCCGA 
CACCAGCGCG TCTTGTGATG AGTCTTCCTC 
CTTTTTTGGG GGCGCCCGGG GAGGCGGCGG 
GGTTGGGGGA CGTCGCGCCG CACCGCGTCC 
TTCCCGACTG GCCATTTCCT TCTCCTATAG 
GAAGGACAGC CTAACCGCCC CCTCTGAGTT 
CGCGCCTACC ACCTTCCCCG TCGAGGCACC 
GCAGGACCCA GGTTTTGTAA GCGAA3ACGA 
AAAGCAAGAC CAGGACAACG CAGAGGCAAA 
GCATGGCGAC TACCTAGATG TGGGAGACGA 
CGCCATTATC TGCGACGCGT TGCAAGAGCG 
CAGCCTTGCC TACGAACGCC ACCTATTCTC 
CGGCACATGC GAGCCCAACC CGCGCCTCAA 
GCTTGCCACC TATCACATCT TTTTCCAAAA 
CCGCAGCCGA GCGGACAAGC AGCTGGCCTT 
CTCGCTCAAC GAAGTGCCAA AAATCTTTGA 
AAACGCTCTG CAACAGGAAA ACAGCGAAAA 
CGAGGGTGAC AACGCGCGCC TAGCCGTACT 
CXACCOGGCA CTTAACCTAC CCCCCAAGGT 
GCGCCGTGCG CAGCCCCTGG AGAGGGATGC 
ACCCGCAGTT GGCGACGAGC AGCTAGCGCG 
GGAGGAGCGA CGCAAACTAA TGATGGCCGC 
GCAGCGGTTC TTTGCTGACC CGGAGATGCA 
CTTTCGACAG GGCTACGTAC GCCAGGCCTG 
GGTCTCCTAC CTTGGAATTT TGCACGAAAA 
GCTCAAGGGC GAGGCGCGCC GCGACTACGT 
CACCTGGCAG ACGGCGATGG GC QTTTG GCA 
GCTGCAGAAA CTGCTAAAGC AAAACTTGAA 
CGTGGCCGCG CACCTGGCGO ACATCATTTT 
GGGTCTGCCA GACTTCACCA GTCAAAGCAT 
GCGCTCAGGA ATCTTGCCCG CCACCTGCTG 
GTACCGOGAA TGCCCTCOGC CGCTTTGGGG 
CCTTGCCIAC CACTCTGACA TAATGGAAGA 
CTGTCGCTGC AACCTATGCA CCCCGCACCG 
CGAAAGTCAA ATTATCGGTA CCTTTGAGCT 
GGCTCCGGGG TTGAAACTCA CTCCGGGGCT 
ACCTGAGGAC TACCACGCCC ACOAGATTAG 
TGCGGAGCTT ACCGCCTGCG TCATTACCCA 
CAACAAAGCC CGCCAAGAGT TTCTGCTACG 
GTCCGGOGAG GAGCTCAACC CAATCCCCCC 
GGC C CTT G CT TCCCAGGATG GCACCCAAAA 
ACGAGGAGGA ATACTGGGAC AGTCAGGCAG 
TGATGGAAGA CTGGGAGAGC CTAGACGAGG 
AAACACCGTC ACCCTCGGTC GCATTCCCCT 
CCAGCATGGC TACAACCTCC GCTCCTCAGG 
ACCGTAGATG GGACACCACT GGAACCAGGG 
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26401 CCGGTAAGTC CAAGCAGCCG CCGCCGTTAO CCCAAGAGCA ACAACAGCGC CAAQOCTACC 
26461 GCTCATGGCG OGGGCACAAG AACGCCATAG TTGCTTGCTT GCAAGACTGT GGGGGCAACA 
26521 TCTCCTTCGC CCX3CCGCTTT CTTCTCTACC ATCACGGCGT GGCCTTCCCC CGTAACATCC 
26581 TGCATTACTA CCGTCATCTC TACAGCCCAT ACTGCACCGG CGGCAGCGGC AGCGGCAGCA 
26641 ACAGCAGCGG CCACACAGAA GCAAAGGCGA CCGGATAGCA AGACTCTQAC AAAGCCCAAG 
26701 AAATCCACAG CGGCGGCAGC AGCAGGAGGA GGAGCGCTGC GTCTGGCGCC CAACGAACCC 
26761 GTATCGACCC GCGAGCTTAG AAACAGGATT TTTCCCACTC TGTATGCTAT ATTTCAACAG 
26821 AGCAGGGGCC AAGAACAAGA GCTGAAAATA AAAAACAGGT CTCTGCOATC CCTCACCCGC 
26861 AGCTGCCTGT ATCACAAAAG CGAAGATCAG CTTCGGCGCA CGCTGGAAGA CGCGGAGGCT 
26941 CTCTTCAGTA AATACTGCGC GCTGACTCTT AAGGACTAGT TTCGCGCCCT TTCTCAAATT 
27001 TAAGCGCGAA AACTACGTCA TCTCCAGCGG CCACACCCGG CGCCAGCACC TGTCGTCAGC 
27061 GCCATTATGA GCAAGGAAAT TCCCACGCCC TACATGTGGA OTTACCAGCC ACAAATGGGA 
27121 CTTGOGGCTG GAOCTGCCCA AGACTACTCA ACCCGAATAA ACTACATGA0 CGCGGGACCC 
27181 CACATGATAT CCCGGGTCAA CGGAATCCGC GCCCACCGAA ACCGAATTCT CTTGGAACAG 
27241 GCGGCTATTA CCACCACACC TCGTAATAAC CTTAATCCCC GTAGTTGGCC CGCTGCCCTG 
27301 GTGTACCAGG AAAGTCCCGC TCCCACCACT GTGGTACTTC CCAGAOACGC CCAGGCOGAA 
27361 GTTCAGATGA CTAACTCAGG GGCGCAGCTT GCGGGCGGCT TTCGTCACAO GGTGCGGTCG 
27421 CCCGGGCAGG GTATAACTCA CCTGACAATC AGAGGGCGAG OTATTCAGCT CAACGACGAG 
27481 TCGGTGAGCT CCTCGCTTGG TCTCCOTCCG GACGGGACAT TTCAGATCGG CGGOGCCGGC 
27541 CGTCCTTCAT TCACGCCTCG TCAGGCAATC CTAACTCTGC AGACCTCGTC CTCTGAGCCG 
27601 CGCTCTGGAG GCATTGGAAC TCTGCAATTT ATTGAGGAGT TTGTGCCATC GGTCTACTTT 
27661 AACCCCTTCT CGGGACCTCC CGGCCACTAT CCGGATCAAT TTATTCCTAA CTTTGACGCG 
27721 GTAAAGGACT CGGCGGACGG CTACGACTGA ATGTTAAGTG GAGAGGCAGA GCAACTGCGC 
27781 CTGAAACACC TGGTCCACTG TCGCCGCCAC AAGTGCTTTG CCCGCGACTC CGGTGAGTTT 
27841 TGCTACTTTG AATTGCCCGA GGATCATATC GAGGGCCCGG CGCACGGCGT CCGGCTTACC 
27901 GCCCAGGGAG AGCTTGCCCG TAGCCTGATT CGGGAGTTTA CCCAGCGCCC CCTGCTAGTT 
27961 GAGCGGGACA GGGGACCCTG TGTTCTCACT GTGATTTGCA ACTGTCCTAA CCTTGGATTA 
28021 CATCAAGATC TTTGTTGCCA TCTCTGTGCT GAGTATAATA AATACAGAAA TTAAAATATA 
28081 CTGGGGCTCC TATOGCCATC CTGTAAACGC CACCGTCTTC ACCCGCCCAA GCAAACCAAG 
28141 GCGAACCTTA CCTGGTACTT TTAACATCTC TCCCTCTGTG ATTTACAACA GTTTCAACCC 
28201 AGACGGAGTG AGTCTACGAG AGAACCTCTC CGAGCTCAGC TACTCCATCA GAAAAAACAC 
28261 CACCCTCCTT ACCTGCCGGG AACGTACGAG TGCGTCACCG GCCGCTGCAC CACACCTACC 
28321 GCCTGACCGT AAACCAGACT TTTTCCGGAC AGACCTCAAT AACTCTGTTT ACCAGAACAG 
28381 GAGGTGAGCT TAGAAAACCC TTAGGGTATT AGGCCAAAGG CGCAGCTACT GTGGGGTTTA 
28441 TGAACAATTC AAGCAACTCT ACGGGCTATT CTAATTCAGG TTTCTCTAGA AGTCAGGCTT 
28S01 CCTGGATGTC AGCATCTOAC TTTGGCCAGC ACCTGTCCCG CGGATTTGTT CCAGTCCAAC 
28561 TAGAGCGACC CACCCTAACA GAGATGACCA ACACAACCAA CGCGGCCGCC GCTACCGGAC 
28621 TTACATCTAC CACAAATACA CCCCAAGTTT CTGCCTTTGT CAATAACTGG GATAACTTGG 
28681 GCATGTGGTG GTTCTCCATA GCGCTTATGT TTGTATGCCT TATTATTATG TGGCTCATCT 
28741 GCTGCCTAAA GCGCAAACGC GCCCGACCAC CCATCTATAG TCCCATCATT GTGCTACACC 
28801 CAAACAATGA TGGAATCCAT AGATTGQACG GACTGAAACA CATGTTCTTT TCTCTTACAG 
28861 TATGATTAAA TGAGATCTAG AAATGGACGG AATTATTACA GAGCAGCGCC TGCTAGAAAG 
28921 ACGCAOGGCA GCGGCCOAGC AACAGCGCAT GAATCAAGAG CTCCAAGACA TGGTTAACTT 
28981 GCACCAGTGC AAAAGGGGTA TCTTTTGTCT GGTAAAGCAG GCCAAAGTCA CCTACGACAG 
29041 TAATACCACC GGACACCGCC TTAGCTACAA GTTGCCAACC AAGCGTCAGA AATTGGTGGT 
29101 CATGGTGGOA GAAAAGCCCA TTACCATAAC TCAGCACTCQ GTAGAAACCG AAGGCTGCAT 
29161 TCACTCACCT TGTCAAGGAC CTGAGGATCT CTGCACCCTT ATTAAGACCC TGTGCGGTCT 
29221 CAAAGATCTT ATTCCCTTTA ACTAATAAAA AAAAAXAATA AAGCATCACT TACTTAAAAT 
29281 CAGTTAGCAA ATTTCTGTCC AGTTTATTCA GCAGCACCTC CTTGCCCTCC TCCCAGCTCT 
29341 GGTATTGCAG C l 'lt X riCCT G GCTGCAAACT TTCTCCACAA TCTAAATGGA ATGTCAGTTT 
29401 CCTCCTGTTC CTOTCCATCC GCACCCACTA TCTTGATGTT GTTGCAOATG AAGCGCGCAA 
29461 GACCGTCTGA AGATACCTTC AACCCCGTGT ATCCATATGA CACGGAAACC GGTCCTCCAA 
29521 CTGTGCCTTT TCTTACTCCT CCCTTTGTAT CCCCCAATGG GTTTCAAGAG AGTCCCCCTG 
29581 GGGTACTCTC TTTGCGCCTA TCCGAACCTC TAOTTACCTC CAATGGCATG CTTGCGCTCA 
29641 AAATGGGCAA CGGCCTCTCT CTGOACGAGO CCGGCAACCT TACCTCCCAA AATGTAACCA 
29701 CTGTGAOCCC ACCTCTCAAA AAAACCAAGT CAAACATAAA CCTGGAAATA TCTGCACCCC 
29761 TCACAGTTAC CTCAGAAGCC CTAACTGTGG CTGCCGCOGC ACCTCTAATG GTCGCGGGCA 
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29821 ACACACTCAC CATGCAATCA CAGGCCCCGC TAACCGTGCA CGACTCCAAA CTTAGCATTG 
29881 CCACCCAAGG ACCCCTCACA GTGTCAGAAG GAAAGCTAGC CCTGCAAACA TCAGGCCCCC 
29941 TCACCACCAC CGATAGCAGT ACCCTTACTA TCACTGCCTC ACCCCCTCTA ACTACTGCCA 
30001 CTGGTAGCTT GGGCATTGAC TTGAAAGAGC CCATTTATAC ACAAAATGGA AAACTAGGAC 
30061 TAAAGTACGG GGCTCCTTTG CATGTAACAG ACGACCTAAA CACTTTGACC GTAGCAACTG 
30121 GTCCAGGTGT GACTATTAAT AATACTTCCT TGCAAACTAA AGTTACTGGA GCCTTGGGTT 
30181 TTGATTCACA AGGCAATATG CAACTTAATG TAGCAGGAGG ACTAAGGATT GATTCTCAAA 
30241 ACAGACGCCT TATACTTGAT GTTAGTTATC CGTTTGATGC TCAAAACCAA CTAAATCTAA 
30301 GACTAGGACA GGGCCCTCTT TTTATAAACT CAGCCCACAA CTTGGATATT AACTACAACA 
30361 AAGGCCTTTA CTTGTTTACA GCTTCAAACA ATTCCAAAAA GCTTGAGGTT AACCTAAGCA 
30421 CTGCCAAGGG GTTGATGTTT GACGCTACAG CCATAGCCAT TAATGCAGGA GATGGGCTTG 
30481 AATTTGGTTC ACCTAATGCA CCAAACACAA ATCCCCTCAA AACAAAAATT G GCCATG GCC 
30541 TAGAATTTGA TTCAAACAAG GCTATGGTTC CTAAACTAGG AACTG GCCT T AGTTTTGACA 
30601 GCACAGGTGC CATTACAGTA GGAAACAAAA ATAATGATAA GCTAACTTTG TGGACCACAC 
30661 CAGCTCCATC TCCTAACTGT AGACTAAATG CAGAGAAAGA TGCTAAACTC ACTTTGGTCT 
30721 TAACAAAATG TGGCAGTCAA ATACTTGCTA CAGTTTCAGT TTTGGCTGTT AAAGGCAGTT 
30781 TGGCTCCAAT ATCTGGAACA GTTCAAAGTG CTCATCTTAT TATAAGATTT GACGAAAATG 
30841 GAGTGCTACT AAACAATTCC TTCCTGGACC CAGAATATTG GAACTTTAGA AATG QAGA TC 
30901 TTACTGAAGG CACAGCCTAT ACAAACGCTG TTGGATTTAT GCCT AACCT A TCAGCTTATC 
30961 CAAAATCTCA CGGTAAAACT GCCAAAAGTA ACATTGTCAG TCAAGTTTAC TTAAACGGAG 
31021 ACAAAACTAA ACCTGTAACA CTAACCATTA CACTAAACGG TACACAGGAA ACAGGAGACA 
31081 CAACTCCAAG TGCATACTCT ATGTCATTTT CATGGGACTG GTCTGGCCAC AACTACATTA 
31141 ATGAAATATT TGCCACATCC TCTTACACTT TTTCATACAT TGCCCAAGAA TAAAGAATCQ 
31201 TTTGTGTTAT GTTTCAACGT GTTTATTTTT CAATTGCAGA AAATTTCAAG TCATTTTTCA 
31261 TTCAGTAGTA TAGCCCCACC ACCACATAGC TTATACAGAT CACCGTACCT TAATCAAACT 
31321 CACAGAACCC TAGTATTCAA CCTGCCACCT CCCTCCCAAC ACACAGAGTA CACAGTCCTT 
31381 TCTCCCCGGC TGGCCTTAAA AAGCATCATA TCATGGGTAA CAGACATATT CTTAGGTGTT 
31441 ATATTCCACA CGGTTTCCTG TCGAGCCAAA CGCTCATCAG TGATATTAAT AAACTCCCCG 
31501 GGCAGCTCAC TTAAGTTCAT GTCGCTGTCC AGCTGCTGAG CCACAGGCTG CTGTCCAACT 
31561 TGCGGTTGCT TAACGGGCGG CGAAGGAGAA GTCCACGCCT ACATGGGGGT AGAGTCATAA 
31621 TCGTGCATCA GGATAGGGCG GTGGTGCTGC AGCAGCGCGC GAATAAACTG CTGCCGCCGC 
31681 CGCTCCGTCC TGCAGGAATA CAACATGGCA GTGGTCTCCT CAGCGATGAT TCGCACCGCC 
31741 CGCAGCATAA GGCGCCTTGT CCTCCGGGCA CAGCAGCGCA CCCTGATCTC ACTTAAATCA 
31801 GCACAGTAAC TGCAGCACAG CACCACAATA TTGTTCAAAA TCCCACAGTG CAAGGCGCTG 
31861 TATCCAAAGC TCATGGCGGG GACCACAGAA CCCACGTGGC CATCATACCA CAAGCGCAGG 
31921 TAGATTAAGT GGCGACCCCT CATAAACACG CTGGACATAA ACATTACCTC TTTTGGCATG 
31981 TTGTAATTCA CCACCTCCCG GTACCATATA AACCTCTGAT TAAACATGGC GCCATCCACC 
32041 ACCATCCTAA ACCAGCTGGC CAAAACCTGC CCGCCGGCTA TACACTGCAG GGAACCGGGA 
32101 CTGGAACAAT GACAGTGGAG AGCCCAGOAC TCGTAACCAT GGATCATCAT GCTCGTCATG 
32161 ATATCAATGT TGGCACAACA CAGGCACACG TGCATACACT TCCTCAGGAT TACAAGCTCC 
32221 TCCCGCGTTA GAACCATATC CCAGGGAACA ACCCATTCCT GAATCAGCGT AAATCCCACA 
32281 CTGCAGGGAA GACCTCGCAC GTAACTCACG TTGTGCATTG TCAAAGTGTT ACATTCGGGC 
32341 AGCAGCGGAT GATCCTCCAG TATGGTAGCG CGGGTTTCTG TCTCAAAAGG AGGTAGACGA 
32401 TCCCTACTGT ACGOAGTGCO CCGAGACAAC CGAGATCGTG TTGGTCGTAG TGTCATGCCA 
32461 AATGGAACGC CGGACGTAGT CATATTTCCT GAAGCAAAAC CAGGTGCGGG CGTOACAAAC 
32521 AGATCTGCGT CTCCGGTCTC GCCGCTTAGA TCGCTCTGTG TAGTAGTTGT AGTATATCCA 
32581 CTCTCTCAAA GCATCCAGGC GCCCCCTGGC TTCGGGTTCT ATGTAAACTC CTTCATOCGC 
32641 CGCTGCCCTG ATAACATCCA CCACCGCAGA ATAAGCCACA CCCAGCCAAC CTACACATTC 
32701 GTTCTGCGAG TCACACACGG GAGGAGCGGG AAGAGCTGGA AGAACCATGT TTTTTTTTTT 
32761 ATTCCAAAAG ATTATCCAAA ACCTCAAAAT GAAGATCTAT TAAQ TGAACQ CGC TCCCC TC 
32821 CGGTGGCGTG GTCAAACTCT ACAGCCAAAG AACAGATAAT GGCATTTOTA AGATGTTGCA 
32881 CAATGGCTTC CAAAAGGCAA ACGGCCCTCA CGTCCAAGTG GACGTAAAGG CTAAACCCTT 
32941 CAGGGTGAAT CTCCTCTATA AACATTCCAG CACCTTCAAC CATGCCCAAA TAATTCTCAT 
33001 CTCGCCACCT TCTCAATATA TCTCTAAGCA AATCCCGAAT ATTAAGTCCG GCCATTGTAA 
33061 AAATCTGCTC CAGAGCGCCC TCCACCTTCA GCCTCAAGCA GCGAATCATG ATTGCAAAAA 
33121 TTCAGGTTCC TCACAOACCT GTATAAOATT CAAAAGCGGA ACATTAACAA AAATACCGCG 
33181 ATCCCGTAGG TCCCTTCGCA GGGCCAGCTG AACATAATCG TGCAGGTCTG CACGGACCAG 
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33241 CGCGGCCACT TCCCCGCCAG GAACCTTGAC AAAAGAACCC ACACTGATTA TGACACGCAT 
33301 ACTCGOAGCT ATGCTAACCA GCGTAOCCCC GATGTAAGCT TTGTTGCATG GGCGGCGATA 
33361 TAAAATGCAA G6TGCT6CTC AAAAAATCAG GCAAAGCCTC GCGCAAAAAA GAAAGCACAT 
33421 CGTA6TCATG CTCATGCAGA TAAAGGCAGG TAAGCTCCGG AACCACCACA GAAAAAGACA 
334B1 CCATTTTTCT CTCAAACATG TCTGCGGGTT TCTGCATAAA CACAAAATAA AATAACAAAA 
33541 AAACATTTAA ACATTAGAAG CCTGTCTTAC AACAGGAAAA ACAACCCTTA TAAGCATAAG 
33601 ACGOACTACO GCCATGCOGG CGTGACCGTA AAAAAACTGG TCACCGTGAT TAAAAAGCAC 
33661 CACCGACAGC TCCTCGGTCA TGTCCGGAGT CATAATGTAA GACTCGGTAA ACACATCAGG 
33721 TTGATTCATC GGTCAGTGCT AAAAAGCGAC CGAAATAGCC CGGGGGAATA CATACCCGCA 
33781 GGCGTAGAGA CAACATTACA GCCCCCATAG GAGGTATAAC AAAATTAATA GGAGAGAAAA 
33841 ACACATAAAC ACCTGAAAAA CCCTCCTGCC TAGGCAAAAT AGCACCCTCC CGCTCCAGAA 
33901 CAACATACAG CGCTTCACAG CGGCAGCCTA ACAGTCAGCC TTACCAGTAA AAAAGAAAAC 
33961 CTATTAAAAA AACACCACTC GACACGGCAC CAGCTCAATC AGTCACAGTG TAAAAAAGGG 
34021 CCAAGTGCAG AGCGAGTATA TATAGGACTA AAAAATGACG TAACGGTTAA AGTCCACAAA 
34081 AAACACCCAG AAAACCGCAC GCGAACCTAC GCCCAGAAAC OAAAGCCAAA AAACCCACAA 
34141 CTTCCTCAAA TCGTCACTTC CGTTTTCCCA CGTTACGTAA CTTCCCATTT TAAGAAAACT 
34201 ACAATTCCCA ACACATACAA GTTACTCCGC CCTAAAACCT ACGTCACCCG CCCCGTTCCC 
34261 ACGCCCCGCG CCACGTCACA AACTCCACCC CCTCATTATC ATATTGGCTT CAATCCAAAA 
34321 TAAGGTATAT TATTGATGAT G 
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E1A Functions Major Late Transcription Unit 

• Induce Ad genes. 

. Drive G, into S-phase. dWtrtC«tXX>«tX> 
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SEQUENCE LISTING 

<110> Wold, William S.M. 
Toth, Karoly 
Doronin, Konstantin 
Tollefson, Ann E. 

<120> Replication-Competent Anti-Cancer Vectors 

<130> 16153-5152 

<140> 
<141> 

<150> 09/351,778 
<151> 1999-07-12 

<160> 20 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 33592 
<212> DNA 

<213> Adenovirus subgroup C 
<4 00> 1 

catcatcaat aatatacctt 
ttgtgacgtg gcgcggggcg 
gatgttgcaa gtgtggcgga 
gtgtgcgccg gtgtacacag 
taaatttggg cgtaaccgag 
agtgaaatct gaataatttt 
gactttgacc gtttacgtgg 
cgggtcaaag ttggcgtttt 
tgagttcctc aagaggccac 
tccgacaccg ggactgaaaa 
ccattttgaa ccacctaccc 
tcccaacgag gaggcggttt 
agggattgac ttactcactt 
ccggcagccc gagcagccgg 
tccacccagt gacgacgagg 
ccccgggcac ggttgcaggt 
tatgtgttcg ctttgctata 
atgggcagtg ggtgatagag 
gttttgtggt ttaaagaatt 
gagcctgagc ccgagccaga 
cctgctatcc tgagacgccc 
agctgtgact ccggtccttc 
cccattaaac cagttgccgt 
gacttgctta acgagcctgg 
ggtgtaaacc tgtgattgcg 
agtttaataa agggtgagat 
aaagggtata taatgcgccg 
gagtgtttgg aagatttttc 
tcttggtttt ggaggtttct 
gaggattaca agtgggaatt 
ttgaatctgg gtcaccaggc 
acaccggggc gcgctgcggc 
gaagaaaccc atctgagcgg 
gcggttgtga gacacaagaa 
ccgacggagg agcagcagca 
ccatggaacc cgagagccgg 
tgtatccaga actgagacgc 
taaagaggga gcggggggct 
taatgaccag acaccgtcct 



attttggatt gaagccaata tgataatgag ggggtggagt 60 
tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taagatttgg ccattttcgc gggaaaactg aataagagga 300 
gtgttactca tagcgcgtaa tatttgtcta gggccgcggg 360 
agactcgccc aggtgttttt ctcaggtgtt ttccgcgttc 420 
attattatag tcagctgacg tgtagtgtat ttatacccgg 480 
tcttgagtgc cagcgagtag agttttctcc tccgagccgc 540 
tgagacatga ggtactggct gataatcttc cacctcctag 600 
ttcacgaact gtatgattta gacgtgacgg cccccgaaga 660 
cgcagatttt tcccgactct gtaatgttgg cggtgcagga 720 
ttccgccggc gcccggttct ccggagccgc ctcacctttc 780 
agcagagagc cttgggtccg gtttgccacg aggctggctt 840 
atgaagaggg tgaggagttt gtgttagatt atgtggagca 900 
cttgtcatta tcaccggagg aatacggggg acccagatat 960 
tgaggacctg tggcatgttt gtctacagta agtgaaaatt 1020 
tggtgggttt ggtgtggtaa tttttttttt aatttttaca 1080 
ttgtattgtg atttttttaa aaggtcctgt gtctgaacct 1140 
accggagcct gcaagaccta cccgccgtcc taaaatggcg 1200 
gacatcacct gtgtctagag aatgcaatag tagtacggat 1260 
taacacacct cctgagatac acccggtggt cccgctgtgc. 1320 
gagagttggt gggcgtcgcc aggctgtgga atgtatcgag 1380 
gcaacctttg gacttgagct gtaaacgccc caggccataa 1440 
tgtgtggtta acgcctttgt ttgctgaatg agttgatgta 1500 
aatgtttaac ttgcatggcg tgttaaatgg ggcggggctt 1560 
tgggctaatc ttggttacat ctgacctcat ggaggcttgg 1620 
tgctgtgcgt aacttgctgg aacagagctc taacagtacc 1680 
gtgggcfctca tcccaggcaa agttagtctg cagaattaag 1740 
tgaagagctt ttgaaatcct gtggtgagct gtttgattct 1800 
gcttttccaa gagaaggtca tcaagacttt ggatttttcc 1860 
tgctgttgct tttttgagtt ttataaagga taaatggagc 1920 
ggggtacctg ctggattttc tggccatgca tctgtggaga 1980 
tcgcctgcta ctgttgtctt ccgtccgccc ggcgataata 2040 
gcagcaggag gaagccaggc ggcggcggca ggagcagagc 2100 
cctggaccct cgggaatgaa tgttgtacag gtggctgaac 2160 
attttgacaa ttacagagga tgggcagggg ctaaaggggg 2220 
tgtgaggcta cagaggaggc taggaatcta gcttttagct 2280 
gagtgtatta cttttcaaca gatcaaggat aattgcgcta 2340 
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atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 



ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 2460 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 2760 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
taggggtttt gcgcgcgcgg taggcccggg 4020 
gtattttttc caggacgtgg taaaggtgac 4080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
gcatacgtgg ggatatgaga tgcatcttgg 4320 
ccatatccct ccggggattc atgttgtgca 4380 
tgggaaattt gtcatgtagc ttagaaggaa 4440 
gacctccaag attttccatg cattcgtcca 4500 
cctgggcgaa gatatttctg ggatcactaa 4560 
cataggccat ttttacaaag cgcgggcgga 4620 
ccggcccagg ggcgtagtta ccctcacaga 4 680 
gggggatcat gtctacctgc ggggcgatga 4740 
gctgggaaga aagcaggttc ctgagcagct 4800 
tcacacctat taccgggtgc aactggtagt 4 860 
gcaggggggc cacttcgtta agcatgtccc 4920 
ccagaaggcg ctcgccgccc agcgatagca 4980 
gtttgagacc gtccgccgta ggcatgcttt 5040 
cccacagctc ggtcacctgc tctacggcat 5100 
gttggggcgg ctttcgctgt acggcagtag 5160 
gtctttccac gggcgcaggg tcctcgtcag 5220 
tccgggctgc gcgctggcca gggtgcgctt 5280 
ccggtcttcg ccctgcgcgt cggccaggta 5340 
ctccgcggcg tggcccttgg cgcgcagctt 5400 
gtgcagactt ttgagggcgt agagcttggg 5460 
atccgcgccg caggccccgc agacggtctc 5520 
ttcggggtca aaaaccaggt ttcccccatg 5580 
catgagccgg tgtccacgct cggtgacgaa 5640 
gagaggcctg tcctcgagcg gtgttccgcg 5700 
tgagacaaag gctcgcgtcc aggccagcac 5760 
gttgtccact agggggtcca ctcgctccag 5820 
atcaaggaag gtgattggtt tgtaggtgta 5880 
gctataaaag ggggtggggg cgcgttcgtc 5940 
ggccagctgt tggggtgagt actccctctg 6000 
gtcagtttcc aaaaacgagg aggatttgat 6060 
ggtggccgca tccatctggt cagaaaagac 6120 
cgacccgtag agggcgttgg acagcaactt 6180 
gcgatcggcg cgctccttgg ccgcgatgtt 6240 
ccattcggga aagacggtgg tgcgctcgtc 6300 
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gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 6360 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 6420 
tggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 6480 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 6540 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 6600 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 6660 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 6720 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 6780 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 6840 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 6900 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 6960 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 7020 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 7080 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 7440 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 7500 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7620 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 7740 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7920 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 7980 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 8040 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 8340 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 8460 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8640 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 8700 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8940 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 9000 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 9420 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 94 80 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 9720 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 
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tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10660 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
gtcaggaggg gcgacatccg cggttgacgc ggcagcagat ggtgattacg aacccccgcg 11100 
gcgccgggcc cggcactacc tggacttgga ggagggcgag ggcctggcgc ggctaggagc 11160 
gccctctcct gagcggtacc caagggtgca gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gccgcggcag aacctgtttc gcgaccgcga gggagaggag cccgaggaga tgcgggatcg 11280 
aaagttccac gcagggcgcg agctgcggca tggcctgaat cgcgagcggt tgctgcgcga 11340. 
ggaggacttt gagcccgacg cgcgaaccgg gattagtccc gcgcgcgcac acgtggcggc 11400 
cgccgacctg gtaaccgcat acgagcagac ggtgaaccag gagattaact ttcaaaaaag 11460 
ctttaacaac cacgtgcgta cgcttgtggc gcgcgaggag gtggctatag gactgatgca 11520 
tctgtgggac tttgtaagcg cgctggagca aaacccaaat agcaagccgc tcatggcgca 11580 
gctgttcctt atagtgcagc acagcaggga caacgaggca ttcagggatg cgctgctaaa 11640 
catagtagag cccgagggcc gctggctgct cgatttgata aacatcctgc agagcatagt 11700 
ggtgcaggag cgcagcttga gcctggctga caaggtggcc gccatcaact attccatgct 11760 
tagcctgggc aagttttacg cccgcaagat ataccatacc ccttacgttc ccatagacaa 11820 
ggaggtaaag atcgaggggt tctacatgcg catggcgctg aaggtgctta ccttgagcga 11880 
cgacctgggc gtttatcgca acgagcgcat ccacaaggcc gtgagcgtga gccggcggcg 11940 
cgagctcagc gaccgcgagc tgatgcacag cctgcaaagg gccctggctg gcacgggcag 12000 
cggcgataga gaggccgagt cctactttga cgcgggcgct gacctgcgct gggccccaag 12060 
ccgacgcgcc ctggaggcag ctggggccgg acctgggctg gcggtggcac ccgcgcgcgc 12120 
tggcaacgtc ggcggcgtgg aggaatatga cgaggacgat gagtacgagc cagaggacgg 12180 
cgagtactaa gcggtgatgt ttctgatcag atgatgcaag acgcaacgga cccggcggtg 12240 
cgggcggcgc tgcagagcca gccgtccggc cttaactcca cggacgactg gcgccaggtc 12300 
atggaccgca tcatgtcgct gactgcgcgc aatcctgacg cgttccggca gcagccgcag 12360 
gccaaccggc tctccgcaat tctggaagcg gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gagaaggtgc tggcgatcgt aaacgcgctg gccgaaaaca gggccatccg gcccgacgag 12480 
gccggcctgg tctacgacgc gctgcttcag cgcgtggctc gttacaacag cggcaacgtg 12540 
cagaccaacc tggaccggct ggtgggggat gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gcgcagcagc agggcaacct gggctccatg gttgcactaa acgccttcct gagtacacag 12660 
cccgccaacg tgccgcgggg acaggaggac tacaccaact ttgtgagcgc actgcggcta 12720 
atggtgactg agacaccgca aagtgaggtg taccagtctg ggccagacta ttttttccag 12780 
accagtagac aaggcctgca gaccgtaaac ctgagccagg ctttcaaaaa cttgcagggg 12840 
ctgtgggggg tgcgggctcc cacaggcgac cgcgcgaccg tgtctagctt gctgacgccc 12900 
aactcgcgcc tgttgctgct gctaatagcg cccttcacgg acagtggcag cgtgtcccgg 12960 
gacacatacc taggtcactt gctgacactg taccgcgagg ccataggtca ggcgcatgtg 13020 
gacgagcata ctttccagga gattacaagt gtcagccgcg cgctggggca ggaggacacg 13080 
ggcagcctgg aggcaaccct aaactacctg ctgaccaacc ggcggcagaa gatcccctcg 13140 
ttgcacagtt taaacagcga ggaggagcgc attttgcgct acgtgcagca gagcgtgagc 13200 
cttaacctga tgcgcgacgg ggtaacgccc agcgtggcgc tggacatgac cgcgcgcaac 13260 
atggaaccgg gcatgtatgc ctcaaaccgg ccgtttatca accgcctaat ggactacttg 13320 
catcgcgcgg ccgccgtgaa ccccgagtat ttcaccaatg ccatcttgaa cccgcactgg 13380 
ctaccgcccc ctggtttcta caccggggga ttcgaggtgc ccgagggtaa cgatggattc 13440 
ctctgggacg acatagacga cagcgtgttt tccccgcaac cgcagaccct gctagagttg 13500 
caacagcgcg agcaggcaga ggcggcgctg cgaaaggaaa gcttccgcag gccaagcagc 13560 
ttgtccgatc taggcgctgc ggccccgcgg tcagatgcta gtagcccatt tccaagcttg 13620 
atagggtctc ttaccagcac tcgcaccacc cgcccgcgcc tgctgggcga ggaggagtac 13680 
ctaaacaact cgctgctgca gccgcagcgc gaaaaaaacc tgcctccggc atttcccaac 13740 
aacgggatag agagcctagt ggacaagatg agtagaftgga agacgtacgc gcaggagcac 13800 
agggacgtgc caggcccgcg cccgcccacc cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
ctggtgtggg aggacgatga ctcggcagac gacagcagcg tcctggattt gggagggagt 13920 
ggcaacccgt. ttgcgcacct tcgccccagg ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atgatgcaaa ataaaaaact caccaaggcc atggcaccga gcgttggttt tcttgtattc 14040 
cccttagtat gcggcgcgcg gcgatgtatg aggaaggtcc tcctccctcc tacgagagtg 14100 
tggtgagcgc ggcgccagtg gcggcggcgc tgggttctcc cttcgatgct cccctggacc 14160 
cgccgtttgt gcctccgcgg tacctgcggc ctaccggggg gagaaacagc atccgttact 14220 
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ctgagttggc acccctattc gacaccaccc gtgtgtacct ggtggacaac aagtcaacgg 14280 
atgtggcatc cctgaactac cagaacgacc acagcaactt tctgaccacg gtcattcaaa 14340 
acaatgacta cagcccgggg gaggcaagca cacagaccat caatcttgac gaccggtcgc 14 400 
actggggcgg cgacctgaaa accatcctgc ataccaacat gccaaatgtg aacgagttca 14460 
tgtttaccaa taagtttaag gcgcgggtga tggtgtcgcg cttgcctact aaggacaatc 14520 
aggtggagct gaaatacgag tgggtggagt tcacgctgcc cgagggcaac tactccgaga 14580 
ccatgaccat agaccttatg aacaacgcga tcgtggagca ctacttgaaa gtgggcagac 14640 
agaacggggt tctggaaagc gacatcgggg taaagtttga cacccgcaac ttcagactgg 14700 
ggtttgaccc cgtcactggt cttgtcatgc ctggggtata tacaaacgaa gccttccatc 14760 
cagacatcat tttgctgcca ggatgcgggg tggacttcac ccacagccgc ctgagcaact 14820 
tgttgggcat ccgcaagcgg caacccttcc aggagggctt taggatcacc tacgatgatc 14880 
tggagggtgg taacattccc gcactgttgg atgtggacgc ctaccaggcg agcttgaaag 14940 
atgacaccga acagggcggg ggtggcgcag gcggcagcaa cagcagtggc agcggcgcgg 15000 
aagagaactc caacgcggca gccgcggcaa tgcagccggt ggaggacatg aacgatcatg 15060 
ccattcgcgg cgacaccttt gccacacggg ctgaggagaa gcgcgctgag gccgaagcag 15120 
cggccgaagc tgccgccccc gctgcgcaac ccgaggtcga gaagcctcag aagaaaccgg 15180 
tgatcaaacc cctgacagag gacagcaaga aacgcagtta caacctaata agcaatgaca 15240 
gcaccttcac ccagtaccgc agctggtacc ttgcatacaa ctacggcgac cctcagaccg 15300 
gaatccgctc atggaccctg ctttgcactc ctgacgtaac ctgcggctcg gagcaggtct 15360 
actggtcgtt gccagacatg atgcaagacc ccgtgacctt ccgctccacg cgccagatca 15420 
gcaactttcc ggtggtgggc gccgagctgt tgcccgtgca ctccaagagc ttctacaacg 15480 
accaggccgt ctactcccaa ctcatccgcc agtttacctc tctgacccac gtgttcaatc 15540 
gctttcccga gaaccagatt ttggcgcgcc cgccagcccc caccatcacc accgtcagtg 15600 
aaaacgttcc tgctctcaca gatcacggga cgctaccgct gcgcaacagc atcggaggag 15660 
tccagcgagt gaccattact gacgccagac gccgcacctg cccctacgtt tacaaggccc 15720 
tgggcatagt ctcgccgcgc gtcctatcga gccgcacttt ttgagcaagc atgtccatcc 15780 
ttatatcgcc cagcaataac acaggctggg gcctgcgctt cccaagcaag atgtttggcg 15840 
gggccaagaa gcgctccgac caacacccag tgcgcgtgcg cgggcactac cgcgcgccct 15900 
ggggcgcgca caaacgcggc cgcactgggc gcaccaccgt cgatgacgcc atcgacgcgg 15960 
tggtggagga ggcgcgcaac tacacgccca cgccgccacc agtgtccaca gtggacgcgg 16020 
ccattcagac cgtggtgcgc ggagcccggc gctatgctaa aatgaagaga cggcggaggc 16080 
gcgtagcacg tcgccaccgc cgccgacccg gcactgccgc ccaacgcgcg gcggcggccc 16140 
tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgg 16200 
ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 
cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 
ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 
actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 16440 
ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 
cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 
gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 
tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 
gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 
ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 
ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 
tggaagatgt cttggaaaaa atgaccgtgg aacctgggct ggagcccgag gtccgcgtgc 17040 
ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 
ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 
ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 
ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 
gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 17340 
ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 
gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 17460 
cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 
gctaccaccc cagcatcgtt taaaagccgg tctttgtggt tcttgcagat atggccctca 17580 
cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 
tggccggcca cggcctgacg. ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 
ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17820 
caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17680 
taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 17940 
cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 
agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 
agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 
ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 
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aaccaggcag tgcaaaataa gattaacagt aagcttgatc cccgccctcc cgtagaggag 18240 
cctccaccgg ccgtggagac agtgtctcca gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacagggaag aaactctggt gacgcaaata gacgagcctc cctcgtacga ggaggcacta 18360 
aagcaaggcc tgcccaccac ccgtcccatc gcgcccatgg ctaccggagt gctgggccag 18420 
cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 18480 
ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 
atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 
tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 
gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 
cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 
agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18900 
tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 
ctgcgtactc gtacaaggcg cggttcaccc tagctgtggg tgataaccgt gtgctggaca 19020 
tggcttccac gtactttgac atccgcggcg tgctggacag gggccctact tttaagccct 19080 
actctggcac tgcctacaac gccctggctc ccaagggtgc cccaaatcct tgcgaatggg 19140 
atgaagctgc tactgctctt gaaataaacc tagaagaaga ggacgatgac aacgaagacg 19200 
aagtagacga gcaagctgag cagcaaaaaa ctcacgtatt tgggcaggcg ccttattctg 19260 
gtataaatat tacaaaggag ggtattcaaa taggtgtcga aggtcaaaca cctaaatatg 19320 
ccgataaaac atttcaacct gaacctcaaa. taggagaatc tcagtggtac gaaactgaaa 19380 
ttaatcatgc agctgggaga gtccttaaaa agactacccc aatgaaacca tgttacggtt 19440 
catatgcaaa acccacaaat gaaaatggag ggcaaggcat tcttgtaaag caacaaaatg 19500 
gaaagctaga aagtcaagtg gaaatgcaat ttttctcaac tactgaggcg accgcaggca 19560 
atggtgataa cttgactcct aaagtggtat tgtacagtga agatgtagat atagaaaccc 19620 
cagacactca tatttcttac atgcccacta ttaaggaagg taactcacga gaactaatgg 19680 
gccaacaatc tatgcccaac aggcctaatt acattgcttt tagggacaat tttattggtc 19740 
taatgtatta caacagcacg ggtaatatgg gtgttctggc gggccaagca- tcgcagttga 19800 
atgctgttgt agatttgcaa gacagaaaca cagagctttc ataccagctt ttgcttgatt 19860 
ccattggtga tagaaccagg tacttttcta tgtggaatca ggctgttgac agctatgatc 19920 
cagatgttag aattattgaa aatcatggaa ctgaagatga acttccaaat tactgctttc 19980 
cactgggagg tgtgattaat acagagactc ttaccaaggt aaaacctaaa acaggtcagg 20040 
aaaatggatg ggaaaaagat gctacagaat tttcagataa aaatgaaata agagttggaa 20100 
ataattttgc catggaaatc aatctaaatg ccaacctgtg gagaaatttc ctgtactcca 20160 
acatagcgct gtatttgccc gacaagctaa agtacagtcc ttccaacgta aaaatttctg 20220 
ataacccaaa cacctacgac tacatgaaca agcgagtggt ggctcccggg ttagtggact 20280 
gctacattaa ccttggagca cgctggtccc ttgactatat ggacaacgtc aacccattta 20340 
accaccaccg caatgctggc ctgcgctacc gctcaatgtt gctgggcaat ggtcgctatg 20400 
tgcccttcca catccaggtg cctcagaagt tctttgccat taaaaacctc cttctcctgc 20460 
cgggctcata cacctacgag tggaacttca ggaaggatgt taacatggtt ctgcagagct 20520 
ccctaggaaa tgacctaagg gttgacggag ccagcattaa gtttgatagc atttgccttt 20580 
acgccacctt cttccccatg gcccacaaca ccgcctccac gcttgaggcc atgcttagaa 20640 
acgacaccaa cgaccagtcc tttaacgact atctctccgc cgccaacatg ctctacccta 20700 
tacccgccaa cgctaccaac gtgcccatat ccatcccctc ccgcaactgg gcggctttcc 20760 
gcggctgggc cttcacgcgc cttaagacta aggaaacccc atcactgggc tcgggctacg 20820 
acccttatta cacctactct ggctctatac cctacctaga tggaaccttt tacctcaacc 20880 
acacctttaa gaaggtggcc attacctttg actcttctgt cagctggcct ggcaatgacc 20940 
gcctgcttac ccccaacgag tttgaaatta agcgctcagt tgacggggag ggttacaacg 21000 
ttgcccagtg taacatgacc aaagactggt tcctggtaca aatgctagct aactacaaca 21060 
ttggctacca gggcttctat atcccagaga gctacaagga ccgcatgtac tccttcttta 21120 
gaaacttcca gcccatgagc cgtcaggtgg tggatgatac taaatacaag gactaccaac 21180 
aggtgggcat cctacaccaa cacaacaact ctggatttgt tggctacctt gcccccacca 21240 
tgcgcgaagg acaggcctac cctgctaact tcccctatcc gcttataggc aagaccgcag 21300 
ttgacagcat tacccagaaa aagtttcttt gcgatcgcac cctttggcgc atcccattct 21360 
ccagtaactt tatgtccatg ggcgcactca cagacctggg ccaaaacctt ctctacgcca 21420 
actccgccca cgcgctagac atgacttttg aggtggatcc catggacgag cccacccttc 21480 
tttatgtttt gtttgaagtc tttgacgtgg tccgtgtgca ccggccgcac cgcggcgtca 21540 
tcgaaaccgt gtacctgcgc acgcccttct cggccggcaa cgccacaaca taaagaagca 21600 
agcaacatca acaacagctg ccgccatggg ctccagtgag caggaactga aagccattgt 21660 
caaagatctt ggttgtgggc catatttttt gggcacctat gacaagcgct ttccaggctt 21720 
tgtttctcca cacaagctcg cctgcgccat agtcaatacg gccggtcgcg agactggggg 21780 
cgtacactgg atggcctttg cctggaaccc gcactcaaaa acatgctacc tctttgagcc 21840 
ctttggcttt tctgaccagc gactcaagca ggtttaccag tttgagtacg agtcactcct 21900 
gcgccgtagc gccattgctt cttcccccga ccgctgtata acgctggaaa agtccaccca 21960 
aagcgtacag gggcccaact cggccgcctg tggactattc tgctgcatgt ttctccacgc 22020 
ctttgccaac tggccccaaa ctcccatgga tcacaacccc accatgaacc ttattaccgg 22080 
ggtacccaac tccatgctca acagtcccca ggtacagccc accctgcgtc gcaaccagga 22140 
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acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgc.ctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 23460 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 25440 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
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agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 

aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 

aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 

C g CC ggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 

cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 

ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 

gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 

tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 

tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 

acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 

aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 

gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 

agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 

agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 

ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 

taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 

gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 

cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 

cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 27240 

gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 

gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 

gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 

cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27480 

tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 

cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27 600 

cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27 660 

aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 

gtaaaggact cggcggacgg ctacgactga taattaagtg gagaggcaga gcaactgcgc 27780 

ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27840 

tgctactttg aattgcccga ggatcatatc gaggatcttt gttgccatct ctgtgctgag 27900 

tataataaat acagaaatta aaatatactg gggctcctat cgccatcctg taaacgccac 27960 

cgtcttcacc cgcccaagca aaccaaggcg aaccttacct ggtactttta acatctctcc 28020 

ctctgtgatt tacaacagtt tcaacccaga cggagtgagt ctacgagaga acctctccga 28080 

gctcagctac tccatcagaa aaaacaccac cctccttacc tgccgggaac gtacccttaa 28140 

ttaaaagtca ggcttcctgg atgtcagcat ctgactttgg ccagcacctg tcccgcggat 28200 

ttgttccagt ccaactacag cgacccaccc taacagagat gaccaacaca accaacgcgg 28260 

ccgccgctac cggacttaca tctaccacaa atacacccca agtttctgcc tttgtcaata 28320 

actgggataa cttgggcatg tggtggttct ccatagcgct tatgtttgta tgccttatta 28380 

ttatgtggct catctgctgc ctaaagcgca aacgcgcccg accacccatc tatagtccca 28440 

tcattgtgct acacccaaac aatgatggaa tccatagatt ggacggactg aaacacatgt 28500 

tcttttctct tacagtatga ttaaatgaga ttaattaagg aatttctgtc cagtttattc 28560 

agcagcacct ccttgccctc ctcccagctc tggtattgca gcttcctcct ggctgcaaac 28620 

tttctccaca atctaaatgg aatgtcagtt tcctcctgtt cctgtccatc cgcacccact 28680 

atcttcatgt tgttgcagat gaagcgcgca agaccgtctg aagatacctt caaccccgtg 28740 

tatccatatg acacggaaac cggtcctcca actgtgcctt ttcttactcc tccctttgta 28800 

tcccccaatg ggtttcaaga gagtccccct ggggtactct ctttgcgcct atccgaacct 28860 

ctagttacct ccaatggcat gcttgcgctc aaaatgggca acggcctctc tctggacgag 28920 

gccggcaacc ttacctccca aaatgtaacc actgtgagcc cacctctcaa aaaaaccaag 28980 

tcaaacataa acctggaaat atctgcaccc ctcacagtta cctcagaagc cctaactgtg 29040 

gctgccgccg cacctctaat ggtcgcgggc aacacactca ccatgcaatc acaggccccg 29100 

ctaaccgtgc acgactccaa acttagcatt gccacccaag gacccctcac agtgtcagaa 29160 

ggaaagctag ccctgcaaac atcaggcccc ctcaccacca ccgatagcag tacccttact 29220 

atcactgcct caccccctct aactactgcc actggtagct tgggcattga cttgaaagag 29280 

cccatttata cacaaaatgg aaaactagga ctaaagtacg gggctccttt gcatgtaaca 29340 

gacgacctaa acactttgac cgtagcaact ggtccaggtg tgactattaa taatacttcc 29400 

ttgcaaacta aagttactgg agccttgggt tttgattcac aaggcaatat gcaacttaat 294 60 

gtagcaggag gactaaggat tgattctcaa aacagacgcc ttatacttga tgttagttat 29520 

ccgtttgatg ctcaaaacca actaaatcta agactaggac agggccctct ttttataaac 29580 

tcagcccaca acttggatat taactacaac aaaggccttt acttgtttac agcttcaaac 29640 

aattccaaaa agcttgaggt taacctaagc actgccaagg ggttgatgtt tgacgctaca 29700 

gccatagcca ttaatgcagg agatgggctt gaatttggtt cacctaatgc accaaacaca 29760 

aatcccctca aaacaaaaat tggccatggc ctagaatttg attcaaacaa ggctatggtt 29820 

cctaaactag gaactggcct tagttttgac agcacaggtg ccattacagt aggaaacaaa 29880 

aataatgata agctaacttt gtggaccaca ccagctccat ctcctaactg tagactaaat 29940 

gcagagaaag atgctaaact cactttggtc ttaacaaaat gtggcagtca aatacttgct 30000 

acagtttcag ttttggctgt taaaggcagt ttggctccaa tatctggaac agttcaaagt 30060 
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gctcatctta ttataagatt tgacgaaaat ggagtgctac taaacaattc cttcctggac 30120 

ccagaatatt ggaactttag aaatggagat cttactgaag gcacagccta tacaaacgct 30180 

gttggattta tgcctaacct atcagcttat ccaaaatctc acggtaaaac tgccaaaagt 30240 

aacattgtca gtcaagttta cttaaacgga gacaaaacta aacctgtaac actaaccatt 30300 

acactaaacg gtacacagga aacaggagac acaactccaa gtgcatactc tatgtcattt 30360 

tcatgggact ggtctggcca caactacatt aatgaaatat ttgccacatc ctcttacact 30420 

ttttcataca ttgcccaaga ataaagaatc gtttgtgtta tgtttcaacg tgtttatttt 30480 

tcaattgcag aaaatttcaa gtcatttttc attcagtagt atagccccac caccacatag 30540 

cttatacaga tcaccgtacc ttaatcaaac tcacagaacc ctagtattca acctgccacc 30600 

tccctcccaa cacacagagt acacagtcct ttctccccgg ctggccttaa aaagcatcat 30660 

atcatgggta acagacatat tcttaggtgt tatattccac acggtttcct gtcgagccaa 30720 

acgctcatca gtgatattaa taaactcccc gggcagctca cttaagttca tgtcgctgtc 30780 

cagctgctga gccacaggct gctgtccaac ttgcggttgc ttaacgggcg gcgaaggaga 30840 

agtccacgcc tacatggggg tagagtcata atcgtgcatc aggatagggc ggtggtgctg 30900 

cagcagcgcg cgaataaact gctgccgccg ccgctccgtc ctgcaggaat acaacatggc 30960 

agtggtctcc tcagcgatga ttcgcaccgc ccgcagcata aggcgccttg tcctccgggc 31020 

acagcagcgc accctgatct cacttaaatc agcacagtaa ctgcagcaca gcaccacaat 31080 

attgttcaaa atcccacagt gcaaggcgct gtatccaaag ctcatggcgg ggaccacaga 31140 

acccacgtgg ccatcatacc acaagcgcag gtagattaag tggcgacccc tcataaacac 31200 

gctggacata aacattacct cttttggcat gttgtaattc accacctccc ggtaccatat 31260 

aaacctctga ttaaacatgg cgccatccac caccatccta aaccagctgg ccaaaacctg 31320 

cccgccggct atacactgca gggaaccggg actggaacaa tgacagtgga gagcccagga 31380 

ctcgtaacca tggatcatca tgctcgtcat gatatcaatg ttggcacaac acaggcacac 31440 

gtgcatacac ttcctcagga ttacaagctc ctcccgcgtt agaaccatat cccagggaac 31500 

aacccattcc tgaatcagcg taaatcccac actgcaggga agacctcgca cgtaactcac 31560 

gttgtgcatt gtcaaagtgt tacattcggg cagcagcgga tgatcctcca gtatggtagc 31620 

gcgggtttct gtctcaaaag gaggtagacg atccctactg tacggagtgc gccgagacaa 31680 

ccgagatcgt gttggtcgta gtgtcatgcc aaatggaacg ccggacgtag tcatatttcc 31740 

tgaagcaaaa ccaggtgcgg gcgtgacaaa cagatctgcg tctccggtct cgccgcttag 31800 

atcgctctgt gtagtagttg tagtatatcc actctctcaa agcatccagg cgccccctgg 31860 

cttcgggttc tatgtaaact ccttcatgcg ccgctgccct gataacatcc accaccgcag 31920 

aataagccac acccagccaa cctacacatt cgttctgcga gtcacacacg ggaggagcgg 31980 

gaagagctgg aagaaccatg tttttttttt tattccaaaa gattatccaa aacctcaaaa 32040 

tgaagatcta ttaagtgaac gcgctcccct ccggtggcgt ggtcaaactc tacagccaaa 32100 

gaacagataa tggcatttgt aagatgttgc acaatggctt ccaaaaggca aacggccctc 32160 

acgtccaagt ggacgtaaag gctaaaccct tcagggtgaa tctcctctat aaacattcca 32220 

gcaccttcaa ccatgcccaa ataattctca tctcgccacc ttctcaatat atctctaagc 32280 

aaatcccgaa tattaagtcc ggccattgta aaaatctgct ccagagcgcc ctccaccttc 32340 

agcctcaagc agcgaatcat gattgcaaaa attcaggttc ctcacagacc tgtataagat 32400 

tcaaaagcgg aacattaaca aaaataccgc gatcccgtag gtcccttcgc agggccagct 32460 

gaacataatc gtgcaggtct gcacggacca gcgcggccac ttccccgcca ggaaccttga 32520 

caaaagaacc cacactgatt atgacacgca tactcggagc tatgctaacc agcgtagccc 32580 

cgatgtaagc tttgttgcat gggcggcgat ataaaatgca aggtgctgct caaaaaatca 32640 

ggcaaagcct cgcgcaaaaa agaaagcaca tcgtagtcat gctcatgcag ataaaggcag 32700 

gtaagctccg gaaccaccac agaaaaagac accatttttc tctcaaacat gtctgcgggt 32760 

ttctgcataa acacaaaata aaataacaaa aaaacattta aacattagaa gcctgtctta 32820 

caacaggaaa aacaaccctt ataagcataa gacggactac ggccatgccg gcgtgaccgt 32880 

aaaaaaactg gtcaccgtga ttaaaaagca ccaccgacag ctcctcggtc atgtccggag 32940 

tcataatgta agactcggta aacacatcag gttgattcat cggtcagtgc taaaaagcga 33000 

ccgaaatagc ccgggggaat acatacccgc aggcgtagag acaacattac agcccccata 33060 

ggaggtataa caaaattaat aggagagaaa aacacataaa cacctgaaaa accctcctgc 33120 

ctaggcaaaa tagcaccctc ccgctccaga acaacataca gcgcttcaca gcggcagcct 33180 

aacagtcagc cttaccagta aaaaagaaaa cctattaaaa aaacaccact cgacacggca 33240 

ccagctcaat cagtcacagt gtaaaaaagg gccaagtgca gagcgagtat atataggact 33300 

aaaaaatgac gtaacggtta aagtccacaa aaaacaccca gaaaaccgca cgcgaaccta 33360 

cgcccagaaa cgaaagccaa aaaacccaca acttcctcaa atcgtcactt ccgttttccc 33420 

acgttacgta acttcccatt ttaagaaaac tacaattccc aacacataca agttactccg 33480 

ccctaaaacc tacgtcaccc gccccgttcc cacgccccgc gccacgtcac aaactccacc 33540 

ccctcattat catattggct tcaatccaaa ataaggtata ttattgatga tg 33592 

<210> 2 
<211> 34341 
<212> DNA 

<213> Adenovirus subgroup C 
<400> 2 
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catcatcaat aatatacctt attttggatt 
ttgtgacgtg gcgcggggcg tgggaacggg 
gatgttgcaa gtgtggcgga acacatgtaa 
gtgtgcgccg gtgtacacag gaagtgacaa 
taaatttggg cgtaaccgag taagatttgg 
agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatga 
ccattttgaa ccacctaccc ttcacgaact 
tcccaacgag gaggcggttt cgcagatttt 
agggattgac ttactcactt ttccgccggc 
ccggcagccc gagcagccgg agcagagagc 
tccacccagt gacgacgagg atgaagaggg 
ccccgggcac ggttgcaggt cttgtcatta 
tatgtgttcg ctttgctata tgaggacctg 
atgggcagtg ggtgatagag tggtgggttt 
gttttgtggt ttaaagaatt ttgtattgtg 
gagcctgagc ccgagccaga accggagcct 
cctgctatcc tgagacgccc gacatcacct 
agctgtgact ccggtccttc taacacacct 
cccattaaac cagttgccgt gagagttggt 
gacttgctta acgagcctgg gcaacctttg 
ggtgtaaacc tgtgattgcg tgtgtggtta 
agtttaataa agggtgagat aatgtttaac 
aaagggtata taatgcgccg tgggctaatc 
gagtgtttgg aagatttttc tgctgtgcgt 
tcttggtttt ggaggtttct gtggggctca 
gaggattaca agtgggaatt tgaagagctt 
ttgaatctgg gtcaccaggc gcttttccaa 
acaccggggc gcgctgcggc tgctgttgct 
gaagaaaccc atctgagcgg ggggtacctg 
gcggttgtga gacacaagaa tcgcctgcta 
ccgacggagg agcagcagca gcagcaggag 
ccatggaacc cgagagccgg cctggaccct 
tgtatccaga actgagacgc attttgacaa 
taaagaggga gcggggggct tgtgaggcta 
taatgaccag acaccgtcct gagtgtatta 
atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 24 0 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 1440 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 1740 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 24 60 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 2760 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
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ggatcaagca agtgtcttgc tgtctttatt taggggtttt gcgcgcgcgg taggcccggg 4020 
accagcggtc tcggtcgttg agggtcctgt gtattttttc caggacgtgg taaaggtgac 4080 
tctggatgtt cagatacatg ggcataagcc cgtctctggg gtggaggtag caccactgca 4140 
gagcttcatg ctgcggggtg gtgttgtaga tgatccagtc gtagcaggag cgctgggcgt 4200 
ggtgcctaaa aatgtctttc agtagcaagc tgattgccag gggcaggccc ttggtgtaag 4260 
tgtttacaaa gcggttaagc tgggatgggt gcatacgtgg ggatatgaga tgcatcttgg 4320 
actgtatttt taggttggct atgttcccag ccatatccct ccggggattc atgttgtgca 4380 
gaaccaccag cacagtgtat ccggtgcact tgggaaattt gtcatgtagc ttagaaggaa 44 40 
atgcgtggaa gaacttggag acgcccttgt gacctccaag attttccatg cattcgtcca 4500 
taatgatggc aatgggccca cgggcggcgg cctgggcgaa gatatttctg ggatcactaa 4560 
cgtcatagtt gtgttccagg atgagatcgt cataggccat ttttacaaag cgcgggcgga 4620 
gggtgccaga ctgcggtata atggttccat ccggcccagg ggcgtagtta ccctcacaga 4680 
tttgcatttc ccacgctttg agttcagatg gggggatcat gtctacctgc ggggcgatga 474 0 
agaaaacggt ttccggggta ggggagatca gctgggaaga aagcaggttc ctgagcagct 4800 
gcgacttacc gcagccggtg ggcccgtaaa tcacacctat taccgggtgc aactggtagt 4860 
taagagagct gcagctgccg tcatccctga gcaggggggc cacttcgtta agcatgtccc 4 920 
tgactcgcat gttttccctg accaaatccg ccagaaggcg ctcgccgccc agcgatagca 4 980 
gttcttgcaa ggaagcaaag tttttcaacg gtttgagacc gtccgccgta ggcatgcttt 5040 
tgagcgtttg accaagcagt tccaggcggt cccacagctc ggtcacctgc tctacggcat 5100 
ctcgatccag catatctcct cgtttcgcgg gttggggcgg ctttcgctgt acggcagtag 5160 
tcggtgctcg tccagacggg ccagggtcat gtctttccac gggcgcaggg tcctcgtcag 5220 
cgtagtctgg gtcacggtga aggggtgcgc tccgggctgc gcgctggcca gggtgcgctt 5280 
gaggctggtc ctgctggtgc tgaagcgctg ccggtcttcg ccctgcgcgt cggccaggta 534 0 
gcatttgacc atggtgtcat agtccagccc ctccgcggcg tggcccttgg cgcgcagctt 54 00 
gcccttggag gaggcgccgc acgaggggca gtgcagactt ttgagggcgt agagcttggg 54 60 
cgcgagaaat accgattccg gggagtaggc atccgcgccg caggccccgc agacggtctc 5520 
gcattccacg agccaggtga gctctggccg ttcggggtca aaaaccaggt ttcccccatg 5580 
ctttttgatg cgtttcttac ctctggtttc catgagccgg tgtccacgct cggtgacgaa 5640 
aaggctgtcc gtgtccccgt atacagactt gagaggcctg tcctcgagcg gtgttccgcg 5700 
gtcctcctcg tatagaaact cggaccactc tgagacaaag gctcgcgtcc aggccagcac 5760 
gaaggaggct aagtgggagg ggtagcggtc gttgtccact agggggtcca ctcgctccag 5820 
ggtgtgaaga cacatgtcgc cctcttcggc atcaaggaag gtgattggtt tgtaggtgta 5880 
ggccacgtga ccgggtgttc ctgaaggggg gctataaaag ggggtggggg cgcgttcgtc 5940 
ctcactctct tccgcatcgc tgtctgcgag ggccagctgt tggggtgagt actccctctg 6000 
aaaagcgggc atgacttctg cgctaagatt gtcagtttcc aaaaacgagg aggatttgat 6060. 
attcacctgg cccgcggtga tgcctttgag ggtggccgca tccatctggt cagaaaagac 6120 
aatctttttg ttgtcaagct tggtggcaaa cgacccgtag agggcgttgg acagcaactt 6180 
ggcgatggag cgcagggttt ggtttttgtc gcgatcggcg cgctccttgg ccgcgatgtt 6240 
tagctgcacg tattcgcgcg caacgcaccg ccattcggga aagacggtgg tgcgctcgtc 6300 
gggcaccagg tgcacgcgcc aaccgcggtt gtgcagggtg acaaggtcaa cgctggtggc 6360 
tacctctccg cgtaggcgct cgttggtcca gcagaggcgg ccgcccttgc gcgagcagaa 6420 
tggcggtagg gggtctagct gcgtctcgtc cggggggtct gcgtccacgg taaagacccc 6480 
gggcagcagg cgcgcgtcga agtagtctat cttgcatcct tgcaagtcta gcgcctgctg 6540 
ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 6600 
gtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 6660 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 6720 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 6780 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 6840 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 6900 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 6960 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 7020 
aaactcttcg cggtctttcc agtactcttg gatcggaaac ccgtcggcct ccgaacggta 7080 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 74 4 0 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 7500 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7620 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 7740 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7920 



WO01/042S2 



12 



PCTAJS00/18971 



gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 7980 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 8040 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 8340 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 84 60 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8640 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 8700 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8940 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 9000 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 9420 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 9480 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 9720 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
gtcaggaggg gcgacatccg cggttgacgc ggcagcagat ggtgattacg aacccccgcg 11100 
gcgccgggcc cggcactacc tggacttgga ggagggcgag ggcctggcgc ggctaggagc 11160 
gccctctcct gagcggtacc caagggtgca gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gccgcggcag aacctgtttc gcgaccgcga gggagaggag cccgaggaga tgcgggatcg 11280 
aaagttccac gcagggcgcg agctgcggca tggcctgaat cgcgagcggt tgctgcgcga 11340 
ggaggacttt gagcccgacg cgcgaaccgg gattagtccc gcgcgcgcac acgtggcggc 11400 
cgccgacctg gtaaccgcat acgagcagac ggtgaaccag gagattaact ttcaaaaaag 11460 
ctttaacaac cacgtgcgta cgcttgtggc gcgcgaggag gtggctatag gactgatgca 11520 
tctgtgggac tttgtaagcg cgctggagca aaacccaaat agcaagccgc tcatggcgca 11580 
gctgttcctt atagtgcagc acagcaggga caacgaggca ttcagggatg cgctgctaaa 11640 
catagtagag cccgagggcc gctggctgct cgatttgata aacatcctgc agagcatagt 11700 
ggtgcaggag cgcagcttga gcctggctga caaggtggcc gccatcaact attccatgct 11760 
tagcctgggc aagttttacg cccgcaagat ataccatacc ccttacgttc ccatagacaa 11820 
ggaggtaaag atcgaggggt tctacatgcg catggcgctg aaggtgctta ccttgagcga 11880 
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cgacctgggc gtttatcgca acgagcgcat ccacaaggcc gtgagcgtga gccggcggcg 11940 
cgagctcagc gaccgcgagc tgatgcacag cctgcaaagg gccctggctg gcacgggcag 12000 
cggcgataga gaggccgagt cctactttga cgcgggcgct gacctgcgct gggccccaag 12060 
ccgacgcgcc ctggaggcag ctggggccgg acctgggctg gcggtggcac ccgcgcgcgc 12120 
tggcaacgtc ggcggcgtgg aggaatatga cgaggacgat gagtacgagc cagaggacgg 12180 
cgagtactaa gcggtgatgt ttctgatcag atgatgcaag acgcaacgga cccggcggtg 12240 
cgggcggcgc tgcagagcca gccgtccggc cttaactcca cggacgactg gcgccaggtc 12300 
atggaccgca tcatgtcgct gactgcgcgc aatcctgacg cgttccggca gcagccgcag 12360 
gccaaccggc tctccgcaat tctggaagcg gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gagaaggtgc tggcgatcgt aaacgcgctg gccgaaaaca gggccatccg gcccgacgag 124 BO 
gccggcctgg tctacgacgc gctgcttcag cgcgtggctc gttacaacag cggcaacgtg 12540 
cagaccaacc tggaccggct ggtgggggat gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gcgcagcagc agggcaacct gggctccatg gttgcactaa acgccttcct gagtacacag 12660 
cccgccaacg tgccgcgggg acaggaggac tacaccaact ttgtgagcgc actgcggcta 12720 
atggtgactg agacaccgca aagtgaggtg taccagtctg ggccagacta ttttttccag 12780 
accagtagac aaggcctgca gaccgtaaac ctgagccagg ctttcaaaaa cttgcagggg 12840 
ctgtgggggg tgcgggctcc cacaggcgac cgcgcgaccg tgtctagctt gctgacgccc 12900 
aactcgcgcc tgttgctgct gctaatagcg cccttcacgg acagtggcag cgtgtcccgg 12960 
gacacatacc taggtcactt gctgacactg taccgcgagg ccataggtca ggcgcatgtg 13020 
gacgagcata ctttccagga gattacaagt gtcagccgcg cgctggggca ggaggacacg 13080 
ggcagcctgg aggcaaccct aaactacctg ctgaccaacc ggcggcagaa gatcccctcg 13140 
ttgcacagtt taaacagcga ggaggagcgc attttgcgct acgtgcagca gagcgtgagc 13200 
cttaacctga tgcgcgacgg ggtaacgccc agcgtggcgc tggacatgac cgcgcgcaac 13260 
atggaaccgg gcatgtatgc ctcaaaccgg ccgtttatca accgcctaat ggactacttg 13320 
catcgcgcgg ccgccgtgaa ccccgagtat ttcaccaatg ccatcttgaa cccgcactgg 13380 
ctaccgcccc ctggtttcta caccggggga ttcgaggtgc ccgagggtaa cgatggattc 13440 
ctctgggacg acatagacga cagcgtgttt tccccgcaac cgcagaccct gctagagttg 13500 
caacagcgcg agcaggcaga ggcggcgctg cgaaaggaaa gcttccgcag gccaagcagc 13560 
ttgtccgatc taggcgctgc ggccccgcgg tcagatgcta gtagcccatt tccaagcttg 13620 
atagggtctc ttaccagcac tcgcaccacc cgcccgcgcc tgctgggcga ggaggagtac 13680 
ctaaacaact cgctgctgca gccgcagcgc gaaaaaaacc tgcctccggc atttcccaac 13740 
aacgggatag agagcctagt ggacaagatg agtagatgga agacgtacgc gcaggagcac 13800 
agggacgtgc caggcccgcg cccgcccacc cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
ctggtgtggg aggacgatga ctcggcagac gacagcagcg tcctggattt gggagggagt 13920 
ggcaacccgt ttgcgcacct tcgccccagg ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atgatgcaaa ataaaaaact caccaaggcc atggcaccga gcgttggttt tcttgtattc 14040 
cccttagtat gcggcgcgcg gcgatgtatg aggaaggtcc tcctccctcc tacgagagtg 14100 
tggtgagcgc ggcgccagtg gcggcggcgc tgggttctcc cttcgatgct cccctggacc 14160 
cgccgtttgt gcctccgcgg tacctgcggc ctaccggggg gagaaacagc atccgttact 14220 
ctgagttggc acccctattc gacaccaccc gtgtgtacct ggtggacaac aagtcaacgg 14280 
atgtggcatc cctgaactac cagaacgacc acagcaactt tctgaccacg gtcattcaaa 14340 
acaatgacta cagcccgggg gaggcaagca cacagaccat caatcttgac gaccggtcgc 14400 
actggggcgg cgacctgaaa accatcctgc ataccaacat gccaaatgtg aacgagttca 14460 
tgtttaccaa taagtttaag gcgcgggtga tggtgtcgcg cttgcctact aaggacaatc 14520 
aggtggagct gaaatacgag tgggtggagt tcacgctgcc cgagggcaac tactccgaga 14580 
ccatgaccat agaccttatg aacaacgcga tcgtggagca ctacttgaaa gtgggcagac 14640 
agaacggggt tctggaaagc gacatcgggg taaagtttga cacccgcaac ttcagactgg 14700 
ggtttgaccc cgtcactggt cttgtcatgc ctggggtata tacaaacgaa gccttccatc 14760 
cagacatcat tttgctgcca ggatgcgggg tggacttcac ccacagccgc ctgagcaact 14820 
tgttgggcat ccgcaagcgg caacccttcc aggagggctt taggatcacc tacgatgatc 14880 
tggagggtgg taacattccc gcactgttgg atgtggacgc ctaccaggcg agcttgaaag 14 940 
atgacaccga acagggcggg ggtggcgcag gcggcagcaa cagcagtggc agcggcgcgg 15000 
aagagaactc caacgcggca gccgcggcaa tgcagccggt ggaggacatg aacgatcatg 15060 
ccattcgcgg cgacaccttt gccacacggg ctgaggagaa gcgcgctgag gccgaagcag 15120 
cggccgaagc tgccgccccc gctgcgcaac ccgaggtcga gaagcctcag aagaaaccgg 15180 
tgatcaaacc cctgacagag gacagcaaga aacgcagtta caacctaata agcaatgaca 15240 
gcaccttcac ccagtaccgc agctggtacc ttgcatacaa ctacggcgac cctcagaccg 15300 
gaatccgctc atggaccctg ctttgcactc ctgacgtaac ctgcggctcg gagcaggtct 15360 
actggtcgtt gccagacatg atgcaagacc ccgtgacctt ccgctccacg cgccagatca 15420 
gcaactttcc ggtggtgggc gccgagctgt tgcccgtgca ctccaagagc ttctacaacg 15480 
accaggccgt ctactcccaa ctcatccgcc agtttacctc tctgacccac gtgttcaatc 15540 
gctttcccga gaaccagatt ttggcgcgcc cgccagcccc caccatcacc accgtcagtg 15600 
aaaacgttcc tgctctcaca gatcacggga cgctaccgct gcgcaacagc atcggaggag 15660 
tccagcgagt gaccattact gacgccagac gccgcacctg cccctacgtt tacaaggccc 15720 
tgggcatagt ctcgccgcgc gtcctatcga gccgcacttt ttgagcaagc atgtccatcc 15780 
ttatatcgcc cagcaataac acaggctggg gcctgcgctt cccaagcaag atgtttggcg 15840 



WO 01/04282 



14 



PCT/US00/18971 



gggccaagaa gcgctccgac caacacccag tgcgcgtgcg cgggcactac cgcgcgccct 15900 
ggggcgcgca caaacgcggc cgcactgggc gcaccaccgt cgatgacgcc atcgacgcgg 15960 
tggtggagga ggcgcgcaac tacacgccca cgccgccacc agtgtccaca gtggacgcgg 16020 
ccattcagac cgtggtgcgc ggagcccggc gctatgctaa aatgaagaga cggcggaggc 16080 
gcgtagcacg tcgccaccgc cgccgacccg gcactgccgc ccaacgcgcg gcggcggccc 16140 
tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgg 16200 
ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 
cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 
ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 
actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 164 40 
ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 
cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 
gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 
tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 
gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 
ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 
ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 
tggaagatgt cttggaaaaa . atgaccgtgg aacctgggct ggagcccgag gtccgcgtgc 17040 
ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 
ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 
ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 
ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 
gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 17340 
ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 
gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 174 60 
cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 
gctaccaccc cagcatcgtt taaaagccgg tctttgtggt tcttgcagat atggccctca 17580 
cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 
tggccggcca cggcctgacg ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 
ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17820 
caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17880 
taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 17940 
cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 
agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 
agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 
ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 
aaccaggcag tgcaaaataa gattaacagt aagcttgatc cccgccctcc cgtagaggag 18240 
cctccaccgg ccgtggagac agtgtctcca gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacagggaag aaactctggt gacgcaaata gacgagcctc cctcgtacga ggaggcacta 18360 
aagcaaggcc tgcccaccac ccgtcccatc gcgcccatgg ctaccggagt gctgggccag 18420 
cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 18480 
ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 
atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 
tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 
gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 
cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 
agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18900 
tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 
ctgcgtactc gtacaaggcg cggttcaccc tagctgtggg tgataaccgt gtgctggaca 19020 
tggcttccac gtactttgac atccgcggcg tgctggacag gggccctact tttaagccct 19080 
actctggcac tgcctacaac gccctggctc ccaagggtgc cccaaatcct tgcgaatggg 19140 
atgaagctgc tactgctctt gaaataaacc tagaagaaga ggacgatgac aacgaagacg 19200 
aagtagacga gcaagctgag cagcaaaaaa ctcacgtatt tgggcaggcg ccttattctg 19260 
gtataaatat tacaaaggag ggtattcaaa taggtgtcga aggtcaaaca cctaaatatg 19320 
ccgataaaac atttcaacct gaacctcaaa taggagaatc tcagtggtac gaaactgaaa 19380 
ttaatcatgc agctgggaga gtccttaaaa agactacccc aatgaaacca tgttacggtt 194 40 
catatgcaaa acccacaaat gaaaatggag ggcaaggcat tcttgtaaag caacaaaatg 19500 
gaaagctaga aagtcaagtg gaaatgcaat ttttctcaac tactgaggcg accgcaggca 19560 
atggtgataa cttgactcct aaagtggtat tgtacagtga agatgtagat atagaaaccc 19620 
cagacactca tatttcttac atgcccacta ttaaggaagg taactcacga gaactaatgg 19680 
gccaacaatc tatgcccaac aggcctaatt acattgcttt tagggacaat tttattggtc 19740 
taatgtatta caacagcacg ggtaatatgg gtgttctggc gggccaagca tcgcagttga 19800 
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atgctgttgt agatttgcaa gacagaaaca cagagctttc ataccagctt ttgcttgatt 19860 
ccattggtga tagaaccagg tacttttcta tgtggaatca ggctgttgac agctatgatc 19920 
cagatgttag aattattgaa aatcatggaa ctgaagatga acttccaaat tactgctttc 19980 
cactgggagg tgtgattaat acagagactc ttaccaaggt aaaacctaaa acaggtcagg 20040 
aaaatggatg ggaaaaagat gctacagaat tttcagataa aaatgaaata agagttggaa 20100 
ataattttgc catggaaatc aatctaaatg ccaacctgtg gagaaatttc ctgtactcca 20160 
acatagcgct gtatttgccc gacaagctaa agtacagtcc ttccaacgta aaaatttctg 20220 
ataacccaaa cacctacgac tacatgaaca agcgagtggt ggctcccggg ttagtggact 20280 
gctacattaa ccttggagca cgctggtccc ttgactatat ggacaacgtc aacccattta 20340 
accaccaccg caatgctggc ctgcgctacc gctcaatgtt gctgggcaat ggtcgctatg 204 00 
tgcccttcca catccaggtg cctcagaagt tctttgccat taaaaacctc cttctcctgc 204 60 
cgggctcata cacctacgag tggaacttca ggaaggatgt taacatggtt ctgcagagct 20520 
ccctaggaaa tgacctaagg gttgacggag ccagcattaa gtttgatagc atttgccttt 20580 
acgccacctt cttccccatg gcccacaaca ccgcctccac gcttgaggcc atgcttagaa 20640 
acgacaccaa cgaccagtcc tttaacgact atctctccgc cgccaacatg ctctacccta 20700 
tacccgccaa cgctaccaac gtgcccatat ccatcccctc ccgcaactgg gcggctttcc 20760 
gcggctgggc cttcacgcgc cttaagacta aggaaacccc atcactgggc tcgggctacg 20820 
acccttatta cacctactct ggctctatac cctacctaga tggaaccttt tacctcaacc 20880 
acacctttaa gaaggtggcc attacctttg actcttctgt cagctggcct ggcaatgacc 20940 
gcctgcttac ccccaacgag tttgaaatta agcgctcagt tgacggggag ggttacaacg 21000 
ttgcccagtg taacatgacc aaagactggt tcctggtaca aatgctagct aactacaaca 21060 
ttggctacca gggcttctat atcccagaga gctacaagga ccgcatgtac tccttcttta 21120 
gaaacttcca gcccatgagc cgtcaggtgg tggatgatac taaatacaag gactaccaac 21180 
aggtgggcat cctacaccaa cacaacaact ctggatttgt tggctacctt gcccccacca 21240 
tgcgcgaagg acaggcctac cctgctaact tcccctatcc gcttataggc aagaccgcag 21300 
ttgacagcat tacccagaaa aagtttcttt gcgatcgcac cctttggcgc atcccattct 21360 
ccagtaactt tatgtccatg ggcgcactca cagacctggg ccaaaacctt ctctacgcca 21420 
actccgccca cgcgctagac atgacttttg aggtggatcc catggacgag cccacccttc 21480 
tttatgtttt gtttgaagtc tttgacgtgg tccgtgtgca ccggccgcac cgcggcgtca 21540 
tcgaaaccgt gtacctgcgc acgcccttct cggccggcaa cgccacaaca taaagaagca 21600 
agcaacatca acaacagctg ccgccatggg ctccagtgag caggaactga aagccattgt 21660 
caaagatctt ggttgtgggc catatttttt gggcacctat gacaagcgct ttccaggctt 21720 
tgtttctcca cacaagctcg cctgcgccat agtcaatacg gccggtcgcg agactggggg 21780 
cgtacactgg atggcctttg cctggaaccc gcactcaaaa acatgctacc tctttgagcc 21840 
ctttggcttt tctgaccagc gactcaagca ggtttaccag tttgagtacg agtcactcct 21900 
gcgccgtagc gccattgctt cttcccccga ccgctgtata acgctggaaa agtccaccca 21960 
aagcgtacag gggcccaact cggccgcctg tggactattc tgctgcatgt ttctccacgc 22020 
ctttgccaac tggccccaaa ctcccatgga tcacaacccc accatgaacc ttattaccgg 22080 
ggtacccaac tccatgctca acagtcccca ggtacagccc accctgcgtc gcaaccagga 22140 
acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttcbgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 23460 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
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gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 238B0 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 241B0 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24 600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24 660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24 900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 25440 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 
gcagggtccc. tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 
cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 
ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 
gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 
tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 
tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 
acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 
aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 
gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 
agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 
ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 
taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 
gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 
cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 
cacatgatat ' cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 27240 
gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 
gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 
gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 
cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27480 
tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 
cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27600 
cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27660 
aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 
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gtaaaggact cggcggacgg ctacgactga atgttaagtg gagaggcaga gcaactgcgc 27780 
ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27840 
tgctactttg aattgcccga ggatcatatc gagggcccgg cgcacggcgt ccggcttacc 27900 
gcccagggag agcttgcccg tagcctgatt cgggagttta cccagcgccc cctgctagtt 27960 
gagcgggaca ggggaccctg tgttctcact gtgatttgca actgtcctaa ccttggatta 28020 
catcaagatc tttgttgcca tctctgtgct gagtataata aatacagaaa ttaaaatata 28080 
ctggggctcc tatcgccatc ctgtaaacgc caccgtcttc acccgcccaa gcaaaccaag 28140 
gcgaacctta cctggtactt ttaacatctc tccctctgtg atttacaaca gtttcaaccc 28200 
agacggagtg agtctacgag agaacctctc cgagctcagc tactccatca gaaaaaacac 28260 
caccctcctt acctgccggg aacgtacgag tgcgtcaccg gccgctgcac cacacctacc 28320 
gcctgaccgt aaaccagact ttttccggac agacctcaat aactctgttt accagaacag 28380 
gaggtgagct tagaaaaccc ttagggtatt aggccaaagg cgcagctact gtggggttta 28440 
tgaacaattc aagcaactct acgggctatt ctaattcagg tttctctaga agtcaggctt 28500 
cctggatgtc agcatctgac tttggccagc acctgtcccg cggatttgtt ccagtccaac 28560 
tacagcgacc caccctaaca gagatgacca acacaaccaa cgcggccgcc gctaccggac 28620 
ttacatctac cacaaataca ccccaagttt ctgcctttgt caataactgg gataacttgg 28680 
gcatgtggtg gttctccata gcgcttatgt ttgtatgcct tattattatg tggctcatct 28740 
gctgcctaaa gcgcaaacgc gcccgaccac ccatctatag tcccatcatt gtgctacacc 28800 
caaacaatga tggaatccat agattggacg gactgaaaca catgttcttt tctcttacag 28860 
tatgattaaa tgagatctag aaatggacgg aattattaca gagcagcgcc tgctagaaag 28920 
acgcagggca gcggccgagc aacagcgcat gaatcaagag ctccaagaca tggttaactt 28980 
gcaccagtgc aaaaggggta tcttttgtct ggtaaagcag gccaaagtca cctacgacag 29040 
taataccacc ggacaccgcc ttagctacaa gttgccaacc aagcgtcaga aattggtggt 29100 
catggtggga gaaaagccca ttaccataac tcagcactcg gtagaaaccg aaggctgcat 29160 
tcactcacct tgtcaaggac ctgaggatct ctgcaccctt attaagaccc tgtgcggtct 29220 
caaagatctt attcccttta actaataaaa aaaaataata aagcatcact tacttaaaat 29280 
cagttagcaa atttctgtcc agtttattca gcagcacctc cttgccctcc tcccagctct 29340 
ggtattgcag cttcctcctg gctgcaaact ttctccacaa tctaaatgga atgtcagttt 29400 
cctcctgttc ctgtccatcc gcacccacta tcttcatgtt gttgcagatg aagcgcgcaa 294 60 
gaccgtctga agataccttc aaccccgtgt atccatatga cacggaaacc ggtcctccaa 29520 
ctgtgccttt tcttactcct ccctttgtat cccccaatgg gtttcaagag agtccccctg 29580 
gggtactctc tttgcgccta tccgaacctc tagttacctc caatggcatg cttgcgctca 29640 
aaatgggcaa cggcctctct ctggacgagg ccggcaacct tacctcccaa aatgtaacca 29700 
ctgtgagccc acctctcaaa aaaaccaagt caaacataaa cctggaaata tctgcacccc 29760 
tcacagttac ctcagaagcc ctaactgtgg ctgccgccgc acctctaatg gtcgcgggca 29820 
acacactcac catgcaatca caggccccgc taaccgtgca cgactccaaa cttagcattg 29880 
ccacccaagg acccctcaca gtgtcagaag gaaagctagc cctgcaaaca tcaggccccc 29940 
tcaccaccac cgatagcagt acccttacta tcactgcctc accccctcta actactgcca 30000 
ctggtagctt gggcattgac ttgaaagagc ccatttatac acaaaatgga aaactaggac 30060 
taaagtacgg ggctcctttg catgtaacag acgacctaaa cactttgacc gtagcaactg 30120 
gtccaggtgt gactattaat aatacttcct tgcaaactaa agttactgga gccttgggtt 30180 
ttgattcaca aggcaatatg caacttaatg tagcaggagg actaaggatt gattctcaaa 30240 
acagacgcct tatacttgat gttagttatc cgtttgatgc tcaaaaccaa ctaaatctaa 30300 
gactaggaca gggccctctt tttataaact cagcccacaa cttggatatt aactacaaca 30360 
aaggccttta cttgtttaca gcttcaaaca attccaaaaa gcttgaggtt aacctaagca 30420 
ctgccaaggg gttgatgttt gacgctacag ccatagccat taatgcagga gatgggcttg 30480 
aatttggttc acctaatgca ccaaacacaa atcccctcaa aacaaaaatt ggccatggcc 30540 
tagaatttga ttcaaacaag gctatggttc ctaaactagg aactggcctt agttttgaca 30600 
gcacaggtgc cattacagta ggaaacaaaa ataatgataa gctaactttg tggaccacac 30660 
cagctccatc tcctaactgt agactaaatg cagagaaaga tgctaaactc actttggtct 30720 
taacaaaatg tggcagtcaa atacttgcta cagtttcagt tttggctgtt aaaggcagtt 30780 
tggctccaat atctggaaca gttcaaagtg ctcatcttat tataagattt gacgaaaatg 30840 
gagtgctact aaacaattcc ttcctggacc cagaatattg gaactttaga aatggagatc 30900 
ttactgaagg cacagcctat acaaacgctg ttggatttat gcctaaccta tcagcttatc 30960 
caaaatctca cggtaaaact gccaaaagta acattgtcag tcaagtttac ttaaacggag 31020 
acaaaactaa acctgtaaca ctaaccatta cactaaacgg tacacaggaa acaggagaca 31080 
caactccaag tgcatactct atgtcatttt catgggactg gtctggccac aactacatta 31140 
atgaaatatt tgccacatcc tcttacactt tttcatacat tgcccaagaa taaagaatcg 31200 
tttgtgttat gtttcaacgt gtttattttt caattgcaga aaatttcaag tcatttttca 31260 
ttcagtagta tagccccacc accacatagc ttatacagat caccgtacct taatcaaact 31320 
cacagaaccc tagtattcaa cctgccacct ccctcccaac acacagagta cacagtcctt 31380 
tctccccggc tggccttaaa aagcatcata tcatgggtaa cagacatatt cttaggtgtt 314 40 
atattccaca cggtttcctg tcgagccaaa cgctcatcag tgatattaat aaactccccg 31500 
ggcagctcac ttaagttcat gtcgctgtcc agctgctgag ccacaggctg ctgtccaact 31560 
tgcggttgct taacgggcgg cgaaggagaa gtccacgcct acatgggggt agagtcataa 31620 
tcgtgcatca ggatagggcg gtggtgctgc agcagcgcgc gaataaactg ctgccgccgc 31680 
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cgctccgtcc tgcaggaata caacatggca gtggtctcct cagcgatgat tcgcaccgcc 31740 

cgcagcataa ggcgccttgt cctccgggca cagcagcgca ccctgatctc acttaaatca 31800 

gcacagtaac tgcagcacag caccacaata ttgttcaaaa tcccacagtg caaggcgctg 31860 

tatccaaagc tcatggcggg gaccacagaa cccacgtggc catcatacca caagcgcagg 31920 

tagattaagt ggcgacccct cataaacacg ctggacataa acattacctc ttttggcatg 31980 

ttgtaattca ccacctcccg gtaccatata aacctctgat taaacatggc gccatccacc 32040 

accatcctaa accagctggc caaaacctgc ccgccggcta tacactgcag ggaaccggga 32100 

ctggaacaat gacagtggag agcccaggac tcgtaaccat ggatcatcat gctcgtcatg 32160 

atatcaatgt tggcacaaca caggcacacg tgcatacact tcctcaggat tacaagctcc 32220 

tcccgcgtta gaaccatatc ccagggaaca acccattcct gaatcagcgt aaatcccaca 32280 

ctgcagggaa gacctcgcac gtaactcacg ttgtgcattg tcaaagtgtt acattcgggc 32340 

agcagcggat gatcctccag tatggtagcg cgggtttctg tctcaaaagg aggtagacga 32400 

tccctactgt acggagtgcg ccgagacaac cgagatcgtg ttggtcgtag tgtcatgcca 324 60 

aatggaacgc cggacgtagt catatttcct gaagcaaaac caggtgcggg cgtgacaaac 32520 

agatctgcgt ctccggtctc gccgcttaga tcgctctgtg tagtagttgt agtatatcca 32580 

ctctctcaaa gcatccaggc gccccctggc ttcgggttct atgtaaactc cttcatgcgc 32640 

cgctgccctg ataacatcca ccaccgcaga ataagccaca cccagccaac ctacacattc 32700 

gttctgcgag tcacacacgg gaggagcggg aagagctgga agaaccatgt tttttttttt 32760 

attccaaaag attatccaaa acctcaaaat gaagatctat taagtgaacg cgctcccctc 32820 

cggtggcgtg gtcaaactct acagccaaag aacagataat ggcatttgta agatgttgca 32880 

caatggcttc caaaaggcaa acggccctca cgtccaagtg gacgtaaagg ctaaaccctt 32940 

cagggtgaat ctcctctata aacattccag caccttcaac catgcccaaa taattctcat 33000 

ctcgccacct tctcaatata tctctaagca aatcccgaat attaagtccg gccattgtaa 33060 

aaatctgctc cagagcgccc tccaccttca gcctcaagca gcgaatcatg attgcaaaaa 33120 

ttcaggttcc tcacagacct gtataagatt caaaagcgga acattaacaa aaataccgcg 33180 

atcccgtagg tcccttcgca gggccagctg aacataatcg tgcaggtctg cacggaccag 33240 

cgcggccact tccccgccag gaaccttgac aaaagaaccc acactgatta tgacacgcat 33300 

actcggagct atgctaacca gcgtagcccc gatgtaagct ttgttgcatg ggcggcgata 33360 

taaaatgcaa ggtgctgctc aaaaaatcag gcaaagcctc gcgcaaaaaa gaaagcacat 33420 

cgtagtcatg ctcatgcaga taaaggcagg taagctccgg aaccaccaca gaaaaagaca 33480 

ccatttttct ctcaaacatg tctgcgggtt tctgcataaa cacaaaataa aataacaaaa 33540 

aaacatttaa acattagaag cctgtcttac aacaggaaaa acaaccctta taagcataag 33600 

acggactacg gccatgccgg cgtgaccgta aaaaaactgg tcaccgtgat taaaaagcac 33660 

caccgacagc tcctcggtca tgtccggagt cataatgtaa gactcggtaa acacatcagg 33720 

ttgattcatc ggtcagtgct aaaaagcgac cgaaatagcc cgggggaata catacccgca 33780 

ggcgtagaga caacattaca gcccccatag gaggtataac aaaattaata ggagagaaaa 33840 

acacataaac acctgaaaaa ccctcctgcc taggcaaaat agcaccctcc cgctccagaa 33900 

caacatacag cgcttcacag cggcagccta acagtcagcc ttaccagtaa aaaagaaaac 33960 

ctattaaaaa aacaccactc gacacggcac cagctcaatc agtcacagtg taaaaaaggg 34020 

ccaagtgcag agcgagtata tataggacta aaaaatgacg taacggttaa agtccacaaa 34080 

aaacacccag aaaaccgcac gcgaacctac gcccagaaac gaaagccaaa aaacccacaa 34140 

cttcctcaaa tcgtcacttc cgttttccca cgttacgtaa cttcccattt taagaaaact 34200 

acaattccca acacatacaa gttactccgc cctaaaacct acgtcacccg ccccgttccc 34260 

acgccccgcg ccacgtcaca aactccaccc cctcattatc atattggctt caatccaaaa 34320 
taaggtatat tattgatgat g 34341 

<210> 3 
<211> 33699 
<212> DNA 

<213> Adenovirus subgroup C 
<400> 3 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 

ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 

gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 

gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 

taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 

agtgaaatct gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg 360 

gactttgacc gtttacgtgg agactcgccc aggtgttttt ctcaggtgtt ttccgcgttc 420 

cgggtcaaag ttggcgtttt attattatag tcagctgacg tgtagtgtat ttatacccgg 480 

tgagttcctc aagaggccac tcttgagtgc cagcgagtag agttttctcc tccgagccgc 540 

tccgacaccg ggactgaaaa tgagacatat tatctgccac ggaggtgtta ttaccgaaga 600 

aatggccgcc agtcttttgg accagctgat cgaagaggta ctggctgata atcttccacc 660 

tcctagccat tttgaaccac ctacccttca cgaactgtat gatttagacg tgacggcccc 720 

cgaagatccc aacgaggagg cggtttcgca gatttttccc gactctgtaa tgttggcggt 780 

gcaggaaggg attgacttac tcacttttcc gccggcgccc ggttctccgg agccgcctca 840 
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cctttcccgg cagcccgagc agccggagca gagagccttg ggtccggttt ctatgccaaa 900 
ccttgtaccg gaggtgatcg atcttacctg ccacgaggct ggctttccac ccagtgacga 960 
cgaggatgaa gagggtgagg agtttgtgtt agattatgtg gagcaccccg ggcacggttg 1020 
caggtcttgt cattatcacc ggaggaatac gggggaccca gatattatgt gttcgctttg 1080 
ctatatgagg acctgtggca tgtttgtcta cagtaagtga aaattatggg cagtgggtga 1140 
tagagtggtg ggtttggtgt ggtaattttt tttttaattt ttacagtttt gtggtttaaa 1200 
gaattttgta ttgtgatttt tttaaaaggt cctgtgtctg aacctgagcc tgagcccgag 1260 
ccagaaccgg agcctgcaag acctacccgc cgtcctaaaa tggcgcctgc tatcctgaga 1320 
cgcccgacat cacctgtgtc tagagaatgc aatagtagta cggatagctg tgactccggt 1380 
ccttctaaca cacctcctga gatacacccg gtggtcccgc tgtgccccat taaaccagtt 1440 
gccgtgagag ttggtgggcg tcgccaggct gtggaatgta tcgaggactt gcttaacgag 1500 
cctgggcaac ctttggactt gagctgtaaa cgccccaggc cataaggtgt aaacctgtga 1560 
ttgcgtgtgt ggttaacgcc tttgtttgct gaatgagttg atgtaagttt aataaagggt 1620 
gagataatgt ttaacttgca tggcgtgtta aatggggcgg ggcttaaagg gtatataatg 1680 
cgccgtgggc taatcttggt tacatctgac ctcatggagg cttgggagtg tttggaagat 1740 
ttttctgctg tgcgtaactt gctggaacag agctctaaca gtacctcttg gttttggagg 1800 
tttctgtggg gctcatccca ggcaaagtta gtctgcagaa ttaaggagga ttacaagtgg 1860 
gaatttgaag agcttttgaa atcctgtggt gagctgtttg attctttgaa tctgggtcac 1920 
caggcgcttt tccaagagaa ggtcatcaag actttggatt tttccacacc ggggcgcgct 1980 
gcggctgctg ttgctttttt gagttttata aaggataaat ggagcgaaga aacccatctg 2040 
agcggggggt acctgctgga ttttctggcc atgcatctgt ggagagcggt tgtgagacac 2100 
aagaatcgcc tgctactgtt gtcttccgtc cgcccggcga taataccgac ggaggagcag 2160 
cagcagcagc aggaggaagc caggcggcgg cggcaggagc agagcccatg gaacccgaga 2220 
gccggcctgg accctcggga atgaatgttg tacaggtggc tgaactgtat ccagaactga 2280 
gacgcatttt gacaattaca gaggatgggc aggggctaaa gggggtaaag agggagcggg 2340 
'gggcttgtga ggctacagag gaggctagga atctagcttt tagcttaatg accagacacc 2400 
gtcctgagtg tattactttt caacagatca aggataattg cgctaatgag cttgatctgc 2460 
tggcgcagaa gtattccata gagcagctga ccacttactg gctgcagcca ggggatgatt 2520 
ttgaggaggc tattagggta tatgcaaagg tggcacttag gccagattgc aagtacaaga 2580 
tcagcaaact tgtaaatatc aggaattgtt gctacatttc tgggaacggg gccgaggtgg 2640 
agatagatac ggaggatagg gtggccttta gatgtagcat gataaatatg tggccggggg 2700 
tgcttggcat ggacggggtg gttattatga atgtaaggtt tactggcccc aattttagcg 2760 
gtacggtttt cctggccaat accaacctta tcctacacgg tgtaagcttc tatgggttta 2820 
acaatacctg tgtggaagcc tggaccgatg taagggttcg gggctgtgcc ttttactgct 2880 
gctggaaggg ggtggtgtgt cgccccaaaa gcagggcttc aattaagaaa tgcctctttg 2940 
aaaggtgtac cttgggtatc ctgtctgagg gtaactccag ggtgcgccac aatgtggcct 3000 
ccgactgtgg ttgcttcatg ctagtgaaaa gcgtggctgt gattaagcat aacatggtat 3060 
gtggcaactg cgaggacagg gcctctcaga tgctgacctg ctcggacggc aactgtcacc 3120 
tgctgaagac cattcacgta gccagccact ctcgcaaggc ctggccagtg tttgagcata 3180 
acatactgac ccgctgttcc ttgcatttgg gtaacaggag gggggtgttc ctaccttacc 3240 
aatgcaattt gagtcacact aagatattgc ttgagcccga gagcatgtcc aaggtgaacc 3300 
tgaacggggt gtttgacatg accatgaaga tctggaaggt gctgaggtac gatgagaccc 3360 
gcaccaggtg cagaccctgc gagtgtggcg gtaaacatat taggaaccag cctgtgatgc 3420 
tggatgtgac cgaggagctg aggcccgatc acttggtgct ggcctgcacc cgcgctgagt 3480 
ttggctctag cgatgaagat acagattgag gtactgaaat gtgtgggcgt ggcttaaggg 3540 
tgggaaagaa tatataaggt gggggtctta tgtagttttg tatctgtttt gcagcagccg 3600 
ccgccgccat gagcaccaac tcgtttgatg gaagcattgt gagctcatat ttgacaacgc 3660 
gcatgccccc atgggccggg gtgcgtcaga atgtgatggg ctccagcatt gatggtcgcc 3720 
ccgtcctgcc cgcaaactct actaccttga cctacgagac cgtgtctgga acgccgttgg 3780 
agactgcagc ctccgccgcc gcttcagccg ctgcagccac cgcccgcggg attgtgactg 3840 
actttgcttt cctgagcccg cttgcaagca gtgcagcttc ccgttcatcc gcccgcgatg 3900 
acaagttgac ggctcttttg gcacaattgg attctttgac ccgggaactt aatgtcgttt 3960 
ctcagcagct gttggatctg cgccagcagg tttctgccct gaaggcttcc tcccctccca 4020 
atgcggttta aaacataaat aaaaaaccag actctgtttg gatttggatc aagcaagtgt 4080 
cttgctgtct ttatttaggg gttttgcgcg cgcggtaggc ccgggaccag cggtctcggt 4140 
cgttgagggt cctgtgtatt ttttccagga cgtggtaaag gtgactctgg atgttcagat 4200 
acatgggcat aagcccgtct ctggggtgga ggtagcacca ctgcagagct tcatgctgcg 4260 
gggtggtgtt gtagatgatc cagtcgtagc aggagcgctg ggcgtggtgc ctaaaaatgt 4320 
ctttcagtag caagctgatt gccaggggca ggcccttggt gtaagtgttt acaaagcggt 4380 
taagctggga tgggtgcata cgtggggata tgagatgcat cttggactgt atttttaggt 4440 
tggctatgtt cccagccata tccctccggg gattcatgtt gtgcagaacc accagcacag 4500 
tgtatccggt gcacttggga aatttgtcat gtagcttaga aggaaatgcg tggaagaact 4560 
tggagacgcc cttgtgacct ccaagatttt ccatgcattc gtccataatg atggcaatgg 4620 
gcccacgggc ggcggcctgg gcgaagatat ttctgggatc actaacgtca tagttgtgtt 4680 
ccaggatgag atcgtcatag gccattttta caaagcgcgg gcggagggtg ccagactgcg 4740 
gtataatggt tccatccggc ccaggggcgt agttaccctc acagatttgc atttcccacg 4800 
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ctttgagttc agatgggggg atcatgtcta 
gggtagggga gatcagctgg gaagaaagca 
cggtgggccc gtaaatcaca cctattaccg 
tgccgtcatc cctgagcagg ggggccactt 
ccctgaccaa atccgccaga aggcgctcgc 
caaagttttt caacggtttg agaccgtccg 
gcagttccag gcggtcccac agctcggtca 
ctcctcgttt cgcgggttgg ggcggctttc 
acgggccagg gtcatgtctt tccacgggcg 
ggtgaagggg tgcgctccgg gctgcgcgct 
ggtgctgaag cgctgccggt cttcgccctg 
gtcatagtcc agcccctccg cggcgtggcc 
gccgcacgag gggcagtgca gacttttgag 
ttccggggag taggcatccg cgccgcaggc 
ggtgagctct ggccgttcgg ggtcaaaaac 
cttacctctg gtttccatga gccggtgtcc 
cccgtataca gacttgagag gcctgtcctc 
aaactcggac cactctgaga caaaggctcg 
ggaggggtag cggtcgttgt ccactagggg 
gtcgccctct tcggcatcaa ggaaggtgat 
tgttcctgaa ggggggctat aaaagggggt 
atcgctgtct gcgagggcca gctgttgggg 
ttctgcgcta agattgtcag tttccaaaaa 
ggtgatgcct ttgagggtgg ccgcatccat 
aagcttggtg gcaaacgacc cgtagagggc 
ggtttggttt ttgtcgcgat cggcgcgctc 
gcgcgcaacg caccgccatt cgggaaagac 
gcgccaaccg cggttgtgca gggtgacaag 
gcgctcgttg gtccagcaga ggcggccgcc 
tagctgcgtc tcgtccgggg ggtctgcgtc 
gtcgaagtag tctatcttgc atccttgcaa 
aagcgcgcgc tcgtatgggt tgagtggggg 
ggcgtacatg ccgcaaatgt cgtaaacgta 
agggtagcat cttccaccgc ggatgctggc 
agcgaggagg tcgggaccga ggttgctacg 
cctgaagatg gcatgtgagt tggatgatat 
gtctgtgaga cctaccgcgt cacgcacgaa 
cagctcggcg gtgacctgca cgtctagggc 
atacttatcc tgtccctttt ttttccacag 
tttccagtac tcttggatcg gaaacccgtc 
gaactggttg acggcctggt aggcgcagca 
cgcggccttc cggagcgagg tgtgggtgag 
gtactggtat ttgaagtcag tgtcgtcgca 
gcgctttttg gaacgcggat ttggcagggc 
cgcgcgaggc ataaagttgc gtgtgatgcg 
aattacctgg gcggcgagca cgatctcgtc 
aagttccaag aagcgcggga tgcccttgat 
gagctcttca ggggagctga gcccgtgctc 
ggaagcgacg aatgagctcc acaggtcacg 
ggtcctaaac tggcgaccta tggccatttt 
gtcttgttcc cagcggtccc atccaaggtt 
aggctcatct ccgccgaact tcatgaccag 
ccccatccaa gtataggtct ctacatcgta 
cgagccgatc gggaagaact ggatctcccg 
gtgaaagtag aagtccctgc gacgggccga 
gcagtactgg cagcggtgca cgggctgtac 
cacaaggaag cagagtggga atttgagccc 
tacttcggct gcttgtcctt gaccgtctgg 
caccacgccg cgcgagccca aagtccagat 
aacatcgcgc agatgggagc tgtccatggt 
gagctcctgc aggtttacct cgcatagacg 
cctaatttcc aggggctggt tggtggcggc 
cggcgcgact acggtaccgc gcggcgggcg 
atctaaaagc ggtgacgcgg gcgagccccc 
agagggggca ggggcacgtc ggcgccgcgc 
ttgctggcga acgcgacgac gcggcggttg 



cctgcggggc gatgaagaaa acggtttccg 4860 
ggttcctgag cagctgcgac ttaccgcagc 4920 
ggtgcaactg gtagttaaga gagctgcagc 4 980 
cgttaagcat gtccctgact cgcatgtttt 504 0 
cgcccagcga tagcagttct tgcaaggaag 5100 
ccgtaggcat gcttttgagc gtttgaccaa 5160 
cctgctctac ggcatctcga tccagcatat 5220 
gctgtacggc agtagtcggt gctcgtccag 5280 
cagggtcctc gtcagcgtag tctgggtcac 5340 
ggccagggtg cgcttgaggc tggtcctgct 5400 
cgcgtcggcc aggtagcatt tgaccatggt 54 60 
cttggcgcgc agcttgccct tggaggaggc 5520 
ggcgtagagc ttgggcgcga gaaataccga 5580 
cccgcagacg gtctcgcatt ccacgagcca 5640 
caggtttccc ccatgctttt tgatgcgttt 5700 
acgctcggtg acgaaaaggc tgtccgtgtc 5760 
gagcggtgtt ccgcggtcct cctcgtatag 5820 
cgtccaggcc agcacgaagg aggctaagtg 5880 
gtccactcgc tccagggtgt gaagacacat 5940 
tggtttgtag gtgtaggcca cgtgaccggg 6000 
gggggcgcgt tcgtcctcac tctcttccgc 6060 
tgagtactcc ctctgaaaag cgggcatgac 6120 
cgaggaggat ttgatattca cctggcccgc 6180 
ctggtcagaa aagacaatct ttttgttgtc 6240 
gttggacagc aacttggcga tggagcgcag 6300 
cttggccgcg atgtttagct gcacgtattc 6360 
ggtggtgcgc tcgtcgggca ccaggtgcac 6420 
gtcaacgctg gtgtfctacct ctccgcgtag 6480 
cttgcgcgag cagaatggcg gtagggggtc 6540 
cacggtaaag accccgggca gcaggcgcgc 6600 
gtctagcgcc tgctgccatg cgcgggcggc 6660 
accccatggc atggggtggg tgagcgcgga 6720 
gaggggctct ctgagtattc caagatatgt 6780 
gcgcacgtaa tcgtatagtt cgtgcgaggg 6840 
ggcgggctgc tctgctcgga agactatctg 6900 
ggttggacgc tggaagacgt tgaagctggc 6960 
ggaggcgtag gagtcgcgca gcttgttgac 7020 
gcagtagtcc agggtttcct tgatgatgtc 7080 
ctcgcggttg aggacaaact cttcgcggtc 7140 
ggcctccgaa cggtaagagc ctagcatgta 7200 
tcccttttct acgggtagcg cgtatgcctg 7260 
cgcaaaggtg tccctgacca tgactttgag 7320 
tccgccctgc tcccagagca aaaagtccgt 7380 
gaaggtgaca tcgttgaaga gtatctttcc 7440 
gaagggtccc ggcacctcgg aacggttgtt 7500 
aaagccgttg atgttgtggc ccacaatgta 7560 
ggaaggcaat tttttaagtt cctcgtaggt 7620 
tgaaagggcc cagtctgcaa gatgagggtt 7680 
ggccattagc atttgcaggt ggtcgcgaaa 7740 
ttctggggtg atgcagtaga aggtaagcgg 7800 
cgcggctagg tctcgcgcgg cagtcactag 7860 
catgaagggc acgagctgct tcccaaaggc 7920 
ggtgacaaag agacgctcgg tgcgaggatg 7980 
ccaccaattg gaggagtggc tattgatgtg 8040 
acactcgtgc tggcttttgt aaaaacgtgc 8100 
atcctgcacg aggttgacct gacgaccgcg 8160 
ctcgcctggc gggtttggct ggtggtcttc 8220 
ctgctcgagg ggagttacgg tggatcggac 8280 
gtccgcgcgc ggcggtcgga gcttgatgac 8340 
ctggagctcc cgcggcgtca ggtcaggcgg 8400 
ggtcagggcg cgggctagat ccaggtgata 8460 
gtcgatggct tgcaagaggc cgcatccccg 8520 
gtgggccgcg ggggtgtcct tggatgatgc 8580 
ggaggtaggg ggggctccgg acccgccggg 8640 
gcgggcagga gctggtgctg cgcgcgtagg 8700 
atctcctgaa tctggcgcct ctgcgtgaag 8760 
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acgacgggcc cggtgagctt gagcctgaaa 
ttgacggcgg cctggcgcaa aatctcctgc 
tcggccatga actgctcgat ctcttcctcc 
gtggcggcga ggtcgttgga aatgcgggcc 
tcgttccaga cgcggctgta gaccacgccc 
tgcgcgagat tgagctccac gtgccgggcg 
aggtagttga gggtggtggc ggtgtgttct 
aacgtggatt cgttgatatc ccccaaggcc 
acggcgaagt tgaaaaactg ggagttgcgc 
cggatgagct cggcgacagt gtcgcgcacc 
tcttcttcaa tctcctcttc cataagggcc 
ggagggggga cacggcggcg acgacggcgc 
atctccccgc ggcgacggcg catggtctcg 
agttggaaga cgccgcccgt catgtcccgg 
agggatacgg cgctaacgat gcatctcaac 
gacctgagcg agtccgcatc gaccggatcg 
tcacagtcgc aaggtaggct gagcaccgtg 
tttctggcgg aggtgctgct gatgatgtaa 
gtcgacagaa gcaccatgtc cttgggtccg 
ccccaggctt cgttttgaca tcggcgcagg 
accggcactt cttcttctcc ttcctcttgt 
gcggcggagt ttggccgtag gtggcgccct 
ctcatcggct gaagcagggc taggtcggcg 
acctgcgtga gggtagactg gaagtcatcc 
ttgatggtgt aagtgcagtt ggccataacg 
gagagctcgg tgtacctgag acgcgagtaa 
gtccgcacca ggtactggta tcccaccaaa 
cagcgtaggg tggccggggc tccgggggcg 
tagatgtacc tggacatcca ggtgatgccg 
cggacgcggt tccagatgtt gcgcagcggc 
ccggtcaggc gcgcgcaatc gttgacgctc 
ggcactcttc cgtggtctgg tggataaatt 
tcgagccccg tatccggccg tccgccgtga 
caggtgtgcg acgtcagaca acgggggagt 
gctgctgcgc tagctttttt ggccactggc 
gcgaaagcat taagtggctc gctccctgta 
gcgggacccc cggttcgagt ctcggaccgg 
ccgtcatgca agaccccgct tgcaaattcc 
ttttcccaga tgcatccggt gctgcggcag 
caagagcagc ggcagacatg cagggcaccc 
acatccgcgg ttgacgcggc agcagatggt 
cactacctgg acttggagga gggcgagggc 
cggtacccaa gggtgcagct gaagcgtgat 
ctgtttcgcg accgcgaggg agaggagccc 
gggcgcgagc tgcggcatgg cctgaatcgc 
cccgacgcgc gaaccgggat tagtcccgcg 
accgcatacg agcagacggt gaaccaggag 
gtgcgtacgc ttgtggcgcg cgaggaggtg 
gtaagcgcgc tggagcaaaa cccaaatagc 
gtgcagcaca gcagggacaa cgaggcattc 
gagggccgct ggctgctcga tttgataaac 
agcttgagcc tggctgacaa ggtggccgcc 
ttttacgccc gcaagatata ccatacccct 
gaggggttct acatgcgcat ggcgctgaag 
tatcgcaacg agcgcatcca caaggccgtg 
cgcgagctga tgcacagcct gcaaagggcc 
gccgagtcct actttgacgc gggcgctgac 
gaggcagctg gggccggacc tgggctggcg 
ggcgtggagg aatatgacga ggacgatgag 
gtgatgtttc tgatcagatg atgcaagacg 
agagccagcc gtccggcctt aactccacgg 
tgtcgctgac tgcgcgcaat cctgacgcgt 
ccgcaattct ggaagcggtg gtcccggcgc 
cgatcgtaaa cgcgctggcc gaaaacaggg 
acgacgcgct gcttcagcgc gtggctcgtt 
accggctggt gggggatgtg cgcgaggccg 



gagagttcga cagaatcaat ttcggtgtcg 8820 
acgtctcctg agttgtcttg ataggcgatc 8880 
tggagatctc cgcgtccggc tcgctccacg 8940 
atgagctgcg agaaggcgtt gaggcctccc 9000 
ccttcggcat cgcgggcgcg catgaccacc 9060 
aagacggcgt agtttcgcag gcgctgaaag 9120 
gccacgaaga agtacataac ccagcgtcgc 9180 
tcaaggcgct ccatggcctc gtagaagtcc 9240 
gccgacacgg ttaactcctc ctccagaaga 9300 
tcgcgctcaa aggctacagg ggcctcttct 9360 
tccccttctt cttcttctgg cggcggtggg 9420 
accgggaggc ggtcgacaaa gcgctcgatc 9480 
gtgacggcgc ggccgttctc gcgggggcgc 9540 
ttatgggttg gcggggggct gccatgcggc 9600 
aattgttgtg taggtactcc gccgccgagg 9660 
gaaaacctct. cgagaaaggc gtctaaccag 9720 
gcgggcggca gcgggcggcg gtcggggttg 9780 
ttaaagtagg cggtcttgag acggcggatg 9840 
gcctgctgaa tgcgcaggcg gtcggccatg 9900 
tctttgtagt agtcttgcat gagcctttct 9960 
cctgcatctc ttgcatctat cgctgcggcg 10020 
cttcctccca tgcgtgtgac cccgaagccc 10080 
acaacgcgct cggctaatat ggcctgctgc 10140 
atgtccacaa agcggtggta tgcgcccgtg 10200 
gaccagttaa cggtctggtg acccggctgc 10260 
gccctcgagt caaatacgta gtcgttgcaa 10320 
aagtgcggcg gcggctggcg gtagaggggc 10380 
agatcttcca acataaggcg atgatatccg 10440 
gcggcggtgg tggaggcgcg cggaaagtcg 10500 
aaaaagtgct ccatggtcgg gacgctctgg 10560 
tagaccgtgc aaaaggagag cctgtaagcg 10620 
cgcaagggta tcatggcgga cgaccggggt 10680 
tccatgcggt taccgcccgc gtgtcgaacc 10740 
gctccttttg gcttccttcc aggcgcggcg 10800 
cgcgcgcagc gtaagcggtt aggctggaaa 10860 
gccggagggt tattttccaa gggttgagtc 10920 
ccggactgcg gcgaacgggg gtttgcctcc 10980 
tccggaaaca gggacgagcc ccttttttgc 11040 
atgcgccccc ctcctcagca gcggcaagag 11100 
tcccctcctc ctaccgcgtc aggaggggcg 11160 
gattacgaac ccccgcggcg ccgggcccgg 11220 
ctggcgcggc taggagcgcc ctctcctgag 11280 
acgcgtgagg cgtacgtgcc gcggcagaac 11340 
gaggagatgc gggatcgaaa gttccacgca 11400 
gagcggttgc tgcgcgagga ggactttgag 114 60 
cgcgcacacg tggcggccgc cgacctggta 11520 
attaactttc aaaaaagctt taacaaccac 11580 
gctataggac tgatgcatct gtgggacttt 11640 
aagccgctca tggcgcagct gttccttata 11700 
agggatgcgc tgctaaacat agtagagccc 11760 
atcctgcaga gcatagtggt gcaggagcgc 11820 
atcaactatt ccatgcttag cctgggcaag 11880 
tacgttccca tagacaagga ggtaaagatc 11940 
gtgcttacct tgagcgacga cctgggcgtt 12000 
agcgtgagcc ggcggcgcga gctcagcgac 12060 
ctggctggca cgggcagcgg cgatagagag 12120 
ctgcgctggg ccccaagccg acgcgccctg 12180 
gtggcacccg cgcgcgctgg caacgtcggc 12240 
tacgagccag aggacggcga gtactaagcg 12300 
caacggaccc ggcggtgcgg gcggcgctgc 12360 
acgactggcg ccaggtcatg gaccgcatca 12420 
tccggcagca gccgcaggcc aaccggctct 12480 
gcgcaaaccc cacgcacgag aaggtgctgg 12540 
ccatccggcc cgacgaggcc ggcctggtct 12600 
acaacagcgg caacgtgcag accaacctgg 12660 
tggcgcagcg tgagcgcgcg cagcagcagg 12720 
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gcaacctggg ctccatggtt gcactaaacg ccttcctgag tacacagccc gccaacgtgc 12780 
cgcggggaca ggaggactac accaactttg tgagcgcact gcggctaatg gtgactgaga 12840 
caccgcaaag tgaggtgtac cagtctgggc cagactattt tttccagacc agtagacaag 12900 
gcctgcagac cgtaaacctg agccaggctt tcaaaaactt gcaggggctg tggggggtgc 12960 
gggctcccac aggcgaccgc gcgaccgtgt ctagcttgct gacgcccaac tcgcgcctgt 13020 
tgctgctgct aatagcgccc ttcacggaca gtggcagcgt gtcccgggac acatacctag 13080 
gtcacttgct gacactgtac cgcgaggcca taggtcaggc gcatgtggac gagcatactt 13140 
tccaggagat tacaagtgtc agccgcgcgc tggggcagga ggacacgggc agcctggagg. 13200 
caaccctaaa ctacctgctg accaaccggc ggcagaagat cccctcgttg cacagtttaa 13260 
acagcgagga ggagcgcatt ttgcgctacg tgcagcagag cgtgagcctt aacctgatgc 13320 
gcgacggggt aacgcccagc gtggcgctgg acatgaccgc gcgcaacatg gaaccgggca 13380 
tgtatgcctcaaaccggccg tttatcaacc gcctaatgga ctacttgcat cgcgcggccg 13440 
ccgtgaaccc cgagtatttc accaatgcca tcttgaaccc gcactggcta ccgccccctg 13500 
gttt.ctacac cgggggattc gaggtgcccg agggtaacga tggattcctc tgggacgaca 13560 
tagacgacag cgtgttttcc ccgcaaccgc agaccctgct agagttgcaa cagcgcgagc 13620 
aggcagaggc ggcgctgcga aaggaaagct tccgcaggcc aagcagcttg tccgatctag 13680 
gcgctgcggc cccgcggtca gatgctagta gcccatttcc aagcttgata gggtctctta 13740 
ccagcactcg caccacccgc ccgcgcctgc tgggcgagga ggagtaccta aacaactcgc 13800 
tgctgcagcc gcagcgcgaa aaaaacctgc ctccggcatt tcccaacaac gggatagaga 13860 
gcctagtgga caagatgagt agatggaaga cgtacgcgca* ggagcacagg gacgtgccag 13920 
gcccgcgccc gcccacccgt cgtcaaaggc acgaccgtca gcggggtctg gtgtgggagg 13980 
acgatgactc ggcagacgac agcagcgtcc tggatttggg agggagtggc aacccgtttg 14040 
cgcaccttcg ccccaggctg gggagaatgt tttaaaaaaa aaaaagcatg atgcaaaata 14100 
aaaaactcac caaggccatg gcaccgagcg ttggttttct tgtattcccc ttagtatgcg 14160 
gcgcgcggcg atgtatgagg aaggtcctcc tccctcctac gagagtgtgg tgagcgcggc 14220 
gccagtggcg gcggcgctgg gttctccctt cgatgctccc ctggacccgc cgtttgtgcc 14280 
tccgcggtac ctgcggccta ccggggggag aaacagcatc cgttactctg agttggcacc 14340 
cctattcgac accacccgtg tgtacctggt ggacaacaag tcaacggatg tggcatccct 14400 
gaactaccag aacgaccaca gcaactttct gaccacggtc attcaaaaca atgactacag 144 60 
cccgggggag gcaagcacac agaccatcaa tcttgacgac cggtcgcact ggggcggcga 14520 
cctgaaaacc atcctgcata ccaacatgcc aaatgtgaac gagttcatgt ttaccaataa 14580 
gtttaaggcg cgggtgatgg tgtcgcgctt gcctactaag gacaatcagg tggagctgaa 14640 
atacgagtgg gtggagttca cgctgcccga gggcaactac tccgagacca tgaccataga 14700 
ccttatgaac aacgcgatcg tggagcacta cttgaaagtg ggcagacaga acggggttct 14760 
ggaaagcgac atcggggtaa agtttgacac ccgcaacttc agactggggt ttgaccccgt 14820 
cactggtctt gtcatgcctg gggtatatac aaacgaagcc ttccatccag acatcatttt 14880 
gctgccagga tgcggggtgg acttcaccca cagccgcctg agcaacttgt tgggcatccg 14940 
caagcggcaa cccttccagg agggctttag gatcacctac gatgatctgg agggtggtaa 15000 
cattcccgca ctgttggatg tggacgccta ccaggcgagc ttgaaagatg acaccgaaca 15060 
gggcgggggt ggcgcaggcg gcagcaacag cagtggcagc ggcgcggaag agaactccaa 15120 
cgcggcagcc gcggcaatgc agccggtgga ggacatgaac gatcatgcca ttcgcggcga 15180 
cacctttgcc acacgggctg aggagaagcg cgctgaggcc gaagcagcgg ccgaagctgc 15240 
cgcccccgct gcgcaacccg aggtcgagaa gcctcagaag aaaccggtga tcaaacccct 15300 
gacagaggac agcaagaaac gcagttacaa cctaataagc aatgacagca ccttcaccca 15360 
gtaccgcagc tggtaccttg catacaacta cggcgaccct cagaccggaa tccgctcatg 15420 
gaccctgctt tgcactcctg acgtaacctg cggctcggag caggtctact ggtcgttgcc 15480 
agacatgatg caagaccccg tgaccttccg ctccacgcgc cagatcagca actttccggt 15540 
ggtgggcgcc gagctgttgc ccgtgcactc caagagcttc tacaacgacc aggccgtcta 15600 
ctcccaactc atccgccagt ttacctctct gacccacgtg ttcaatcgct ttcccgagaa 15660 
ccagattttg gcgcgcccgc cagcccccac catcaccacc gtcagtgaaa acgttcctgc 15720 
tctcacagat cacgggacgc taccgctgcg caacagcatc ggaggagtcc agcgagtgac 15780 
cattactgac gccagacgcc gcacctgccc ctacgtttac aaggccctgg gcatagtctc 15840 
gccgcgcgtc ctatcgagcc gcactttttg agcaagcatg tccatcctta tatcgcccag 15900 
caataacaca ggctggggcc tgcgcttccc aagcaagatg tttggcgggg ccaagaagcg 15960 
ctccgaccaa cacccagtgc gcgtgcgcgg gcactaccgc gcgccctggg gcgcgcacaa 16020 
acgcggccgc actgggcgca ccaccgtcga tgacgccatc gacgcggtgg tggaggaggc 16080 
gcgcaactac acgcccacgc cgccaccagt gtccacagtg gacgcggcca ttcagaccgt 16140 
ggtgcgcgga gcccggcgct atgctaaaat gaagagacgg cggaggcgcg tagcacgtcg 16200 
ccaccgccgc cgacccggca ctgccgccca acgcgcggcg gcggccctgc ttaaccgcgc 16260 
acgtcgcacc ggccgacggg cggccatgcg ggccgctcga aggctggccg cgggtattgt 16320 
cactgtgccc cccaggtcca ggcgacgagc ggccgccgca gcagccgcgg ccattagtgc 16380 
tatgactcag ggtcgcaggg gcaacgtgta ttgggtgcgc gactcggtta gcggcctgcg 1644 0 
cgtgcccgtg cgcacccgcc ccccgcgcaa ctagattgca agaaaaaact acttagactc 16500 
gtactgttgt atgtatccag cggcggcggc gcgcaacgaa gctatgtcca agcgcaaaat 16560 
caaagaagag atgctccagg tcatcgcgcc ggagatctat ggccccccga agaaggaaga 16620 
gcaggattac aagccccgaa agctaaagcg ggtcaaaaag aaaaagaaag atgatgatga 16680 
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tgaacttgac gacgaggtgg aactgctgca cgctaccgcg cccaggcgac gggtacagtg 16740 
gaaaggtcga cgcgtaaaac gtgttttgcg acccggcacc accgtagtct ttacgcccgg 16800 
tgagcgctcc acccgcacct acaagcgcgt gtatgatgag gtgtacggcg acgaggacct 16860 
gcttgagcag gccaacgagc gcctcgggga gtttgcctac ggaaagcggc ataaggacat 16920 
gctggcgttg ccgctggacg agggcaaccc aacacctagc ctaaagcccg taacactgca 16980 
gcaggtgctg cccgcgcttg caccgtccga agaaaagcgc ggcctaaagc gcgagtctgg .17040 
tgacttggca cccaccgtgc agctgatggt acccaagcgc cagcgactgg aagatgtctt 17100 
ggaaaaaatg accgtggaac ctgggctgga gcccgaggtc cgcgtgcggc caatcaagca 17160 
ggtggcgccg ggactgggcg tgcagaccgt ggacgttcag atacccacta ccagtagcac 17220 
cagtattgcc accgccacag agggcatgga gacacaaacg tccccggttg cctcagcggt 17280 
ggcggatgcc gcggtgcagg cggtcgctgc ggccgcgtcc aagacctcta cggaggtgca 17340 
aacggacccg tggatgtttc gcgtttcagc cccccggcgc ccgcgcggtt cgaggaagta 17400 
cggcgccgcc agcgcgctac tgcccgaata tgccctacat ccttccattg cgcctacccc 174 60 
cggctatcgt ggctacacct accgccccag aagacgagca actacccgac gccgaaccac 17520 
cactggaacc cgccgccgcc gtcgccgtcg ccagcccgtg ctggccccga tttccgtgcg 17580 
cagggtggct cgcgaaggag gcaggaccct ggtgctgcca acagcgcgct accaccccag 17640 
catcgtttaa aagccggtct ttgtggttct tgcagatatg gccctcacct gccgcctccg 17700 
tttcccggtg ccgggattcc gaggaagaat gcaccgtagg aggggcatgg ccggccacgg 17760 
cctgacgggc ggcatgcgtc gtgcgcacca ccggcggcgg cgcgcgtcgc accgtcgcat 17820 
gcgcggcggt atcctgcccc tccttattcc actgatcgcc gcggcgattg gcgccgtgcc 17880 
cggaattgca tccgtggcct tgcaggcgca gagacactga ttaaaaacaa gttgcatgtg 17940 
gaaaaatcaa aataaaaagt ctggactctc acgctcgctt ggtcctgtaa ctattttgta 18000 
gaatggaaga catcaacttt gcgtctctgg ccccgcgaca cggctcgcgc ccgttcatgg 18060 
gaaactggca agatatcggc accagcaata tgagcggtgg cgccttcagc tggggctcgc 18120 
tgtggagcgg cattaaaaat ttcggttcca ccgttaagaa ctatggcagc aaggcctgga 18180 
acagcagcac aggccagatg ctgagggata agttgaaaga gcaaaatttc caacaaaagg 18240 
tggtagatgg cctggcctct ggcattagcg gggtggtgga cctggccaac caggcagtgc 18300 
aaaataagat taacagtaag cttgatcccc gccctcccgt agaggagcct ccaccggccg 18360 
tggagacagt gtctccagag gggcgtggcg aaaagcgtcc gcgccccgac agggaagaaa 18420 
ctctggtgac gcaaatagac gagcctccct cgtacgagga ggcactaaag caaggcctgc 184 80 
ccaccacccg tcccatcgcg cccatggcta ccggagtgct gggccagcac acacccgtaa 18540 
cgctggacct gcctcccccc gccgacaccc agcagaaacc tgtgctgcca m ggcccgaccg 18600 
ccgttgttgt aacccgtcct agccgcgcgt ccctgcgccg cgccgccagc ggtccgcgat 18660 
cgttgcggcc cgtagccagt ggcaactggc aaagcacact gaacagcatc gtgggtctgg 18720 
gggtgcaatc cctgaagcgc cgacgatgct tctgaatagc taacgtgtcg tatgtgtgtc 18780 
atgtatgcgt ccatgtcgcc gccagaggag ctgctgagcc gccgcgcgcc cgctttccaa 18840 
gatggctacc ccttcgatga tgccgcagtg gtcttacatg cacatctcgg gccaggacgc 18900 
ctcggagtac ctgagccccg ggctggtgca gtttgcccgc gccaccgaga cgtacttcag 18960 
cctgaataac aagtttagaa accccacggt ggcgcctacg cacgacgtga ccacagaccg 19020 
gtcccagcgt ttgacgctgc ggttcatccc tgtggaccgt gaggatactg cgtactcgta 19080 
caaggcgcgg ttcaccctag ctgtgggtga taaccgtgtg ctggacatgg cttccacgta 19140 
ctttgacatc cgcggcgtgc tggacagggg ccctactttt aagccctact ctggcactgc 19200 
ctacaacgcc ctggctccca agggtgcccc aaatccttgc gaatgggatg aagctgctac 19260 
tgctcttgaa ataaacctag aagaagagga cgatgacaac gaagacgaag tagacgagca 19320 
agctgagcag caaaaaactc acgtatttgg gcaggcgcct tattctggta taaatattac 19380 
aaaggagggt attcaaatag gtgtcgaagg tcaaacacct aaatatgccg ataaaacatt 19440 
tcaacctgaa cctcaaatag gagaatctca gtggtacgaa actgaaatta atcatgcagc 19500 
tgggagagtc cttaaaaaga ctaccccaat gaaaccatgt tacggttcat atgcaaaacc 19560 
cacaaatgaa aatggagggc aaggcattct tgtaaagcaa caaaatggaa agctagaaag 19620 
tcaagtggaa atgcaatttt tctcaactac tgaggcgacc gcaggcaatg gtgataactt 19680 
gactcctaaa gtggtattgt acagtgaaga tgtagatata gaaaccccag acactcatat 19740 
ttcttacatg cccactatta aggaaggtaa ctcacgagaa ctaatgggcc aacaatctat 19800 
gcccaacagg cctaattaca ttgcttttag ggacaatttt attggtctaa tgtattacaa 19860 
cagcacgggt aatatgggtg ttctggcggg ccaagcatcg cagttgaatg ctgttgtaga 19920 
tttgcaagac agaaacacag agctttcata ccagcttttg cttgattcca ttggtgatag 19980 
aaccaggtac ttttctatgt ggaatcaggc tgttgacagc tatgatccag atgttagaat 20040 
tattgaaaat catggaactg aagatgaact tccaaattac tgctttccac tgggaggtgt 20100 
gattaataca gagactctta ccaaggtaaa acctaaaaca ggtcaggaaa atggatggga 20160 
aaaagatgct acagaatttt cagataaaaa tgaaataaga gttggaaata attttgccat 20220 
ggaaatcaat ctaaatgcca acctgtggag aaatttcctg tactccaaca tagcgctgta 20280 
tttgcccgac aagctaaagt acagtccttc caacgtaaaa atttctgata acccaaacac 20340 
ctacgactac atgaacaagc gagtggtggc tcccgggtta gtggactgct acattaacct 20400 
tggagcacgc tggtcccttg actatatgga caacgtcaac ccatttaacc accaccgcaa 20460 
tgctggcctg cgctaccgct caatgttgct gggcaatggt cgctatgtgc ccttccacat 20520 
ccaggtgcct cagaagttct ttgccattaa aaacctcctt ctcctgccgg gctcatacac 20580 
ctacgagtgg aacttcagga aggatgttaa catggttctg cagagctccc taggaaatga 20640 



WO 01/04282 



24 



PCT/USOO/18971 



cctaagggtt gacggagcca gcattaagtt tgatagcatt tgcctttacg ccaccttctt 20700 
ccccatggcc cacaacaccg cctccacgct tgaggccatg cttagaaacg acaccaacga 20760 
ccagtccttt aacgactatc tctccgccgc caacatgctc taccctatac ccgccaacgc 20820 
taccaacgtg cccatatcca tcccctcccg caactgggcg gctttccgcg gctgggcctt 20880 
cacgcgcctt aagactaagg aaaccccatc actgggctcg ggctacgacc cttattacac 20940 
ctactctggc tctataccct acctagatgg aaccttttac ctcaaccaca cctttaagaa 21000 
ggtggccatt acctttgact cttctgtcag ctggcctggc aatgaccgcc tgcttacccc 21060 
caacgagttt gaaattaagc gctcagttga cggggagggt tacaacgttg cccagtgtaa 21120 
catgaccaaa gactggttcc tggtacaaat gctagctaac tacaacattg gctaccaggg 21180 
cttctatatc ccagagagct acaaggaccg catgtactcc ttctttagaa acttccagcc 21240 
catgagccgt caggtggtgg atgatactaa atacaaggac taccaacagg tgggcatcct 21300 
acaccaacac aacaactctg gatttgttgg ctaccttgcc cccaccatgc gcgaaggaca 21360 
ggcctaccct gctaacttcc cctatccgct tataggcaag accgcagttg acagcattac 21420 
ccagaaaaag tttctttgcg atcgcaccct ttggcgcatc ccattctcca gtaactttat 21480 
gtccatgggc gcactcacag acctgggcca aaaccttctc tacgccaact ccgcccacgc 21540 
gctagacatg acttttgagg tggatcccat ggacgagccc acccttcttt atgttttgtt 21600 
tgaagtcttt gacgtggtcc gtgtgcaccg gccgcaccgc ggcgtcatcg aaaccgtgta 21660 
cctgcgcacg cccttctcgg ccggcaacgc cacaacataa agaagcaagc aacatcaaca 21720 
acagctgccg ccatgggctc cagtgagcag gaactgaaag ccattgtcaa agatcttggt 21780 
tgtgggccat attttttggg cacctatgac aagcgctttc caggctttgt ttctccacac 21840 
aagctcgcct gcgccatagt caatacggcc ggtcgcgaga ctgggggcgt acactggatg 21900 
gcctttgcct ggaacccgca ctcaaaaaca tgctacctct ttgagccctt tggcttttct 21960 
gaccagcgac tcaagcaggt ttaccagttt gagtacgagt cactcctgcg ccgtagcgcc 22020 
attgcttctt cccccgaccg ctgtataacg ctggaaaagt ccacccaaag cgtacagggg 22080 
cccaactcgg ccgcctgtgg actattctgc tgcatgtttc tccacgcctt tgccaactgg 22140 
ccccaaactc ccatggatca caaccccacc atgaacctta ttaccggggt acccaactcc 22200 
atgctcaaca gtccccaggt acagcccacc ctgcgtcgca accaggaaca gctctacagc 2?260" 
ttcctggagc gccactcgcc ctacttccgc agccacagtg cgcagattag gagcgccact 22320 
tctttttgtc acttgaaaaa catgtaaaaa taatgtacta gagacacttt caataaaggc 22380 
aaatgctttt atttgtacac tctcgggtga ttatttaccc ccacccttgc cgtctgcgcc 22440 
gtttaaaaat caaaggggtt ctgccgcgca tcgctatgcg ccactggcag ggacacgttg 22500 
cgatactggt gtttagtgct ccacttaaac tcaggcacaa ccatccgcgg cagctcggtg 22560 
aagttttcac tccacaggct gcgcaccatc accaacgcgt ttagcaggtc gggcgccgat 22620 
atcttgaagt cgcagttggg gcctccgccc tgcgcgcgcg agttgcgata cacagggttg 22680 
cagcactgga acactatcag cgccgggtgg tgcacgctgg ccagcacgct cttgtcggag 22740 
atcagatccg cgtccaggtc ctccgcgttg ctcagggcga acggagtcaa ctttggtagc 22800 
tgccttccca aaaagggcgc gtgcccaggc tttgagttgc actcgcaccg tagtggcatc 22860 
aaaaggtgac cgtgcccggt ctgggcgtta ggatacagcg cctgcataaa agccttgatc 22920 
tgcttaaaag ccacctgagc ctttgcgcct tcagagaaga acatgccgca agacttgccg 22980 
gaaaactgat tggccggaca ggccgcgtcg tgcacgcagc accttgcgtc ggtgttggag 23040 
atctgcacca catttcggcc ccaccggttc ttcacgatct tggccttgct agactgctcc 23100 
ttcagcgcgc gctgcccgtt ttcgctcgtc acatccattt caatcacgtg ctccttattt 23160 
atcataatgc ttccgtgtag acacttaagc tcgccttcga tctcagcgca gcggtgcagc 23220 
cacaacgcgc agcccgtggg ctcgtgatgc ttgtaggtca cctctgcaaa cgactgcagg 23280 
tacgcctgca ggaatcgccc catcatcgtc acaaaggtct tgttgctggt gaaggtcagc 23340 
tgcaacccgc ggtgctcctc gttcagccag gtcttgcata cggccgccag agcttccact 23400 
tggtcaggca gtagtttgaa gttcgccttt agatcgttat ccacgtggta cttgtccatc 234 60 
agcgcgcgcg cagcctccat gcccttctcc cacgcagaca cgatcggcac actcagcggg 23520 
ttcatcaccg taatttcact ttccgcttcg ctgggctctt cctcttcctc ttgcgtccgc 23580 
ataccacgcg ccactgggtc gtcttcattc agccgccgca ctgtgcgctt acctcctttg 23640 
ccatgcttga ttagcaccgg tgggttgctg aaacccacca tttgtagcgc cacatcttct 23700 
ctttcttcct cgctgtccac gattacctct ggtgatggcg ggcgctcggg cttgggagaa 23760 
gggcgcttct ttttcttctt gggcgcaatg gccaaatccg ccgccgaggt cgatggccgc 23820 
gggctgggtg tgcgcggcac cagcgcgtct tgtgatgagt cttcctcgtc ctcggactcg 23880 
atacgccgcc tcatccgctt ttttgggggc gcccggggag gcggcggcga cggggacggg 23940 
gacgacacgt cctccatggt tgggggacgt cgcgccgcac cgcgtccgcg ctcgggggtg 24000 
gtttcgcgct gctcctcttc ccgactggcc atttccttct cctataggca gaaaaagatc 24060 
atggagtcag tcgagaagaa ggacagccta accgccccct ctgagttcgc caccaccgcc 24120 
tccaccgatg ccgccaacgc gcctaccacc ttccccgtcg aggcaccccc gcttgaggag 24180 
gaggaagtga ttatcgagca ggacccaggt tttgtaagcg aagacgacga ggaccgctca 24240 
gtaccaacag aggataaaaa gcaagaccag gacaacgcag aggcaaacga ggaacaagtc 24300 
gggcgggggg acgaaaggca tggcgactac ctagatgtgg gagacgacgt gctgttgaag 24360 
catctgcagc gccagtgcgc cattatctgc gacgcgttgc aagagcgcag cgatgtgccc 24420 
ctcgccatag cggatgtcag ccttgcctac gaacgccacc tattctcacc gcgcgtaccc 24480 
cccaaacgcc aagaaaacgg cacatgcgag cccaacccgc gcctcaactt ctaccccgta 24540 
tttgccgtgc cagaggtgct tgccacctat cacatctttt tccaaaactg caagataccc 24 600 
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ctatcctgcc gtgccaaccg cagccgagcg gacaagcagc tggccttgcg gcagggcgct 24 660 
gtcatacctg atatcgcctc gctcaacgaa gtgccaaaaa tctttgaggg tcttggacgc 24720 
gacgagaagc gcgcggcaaa cgctctgcaa caggaaaaca gcgaaaatga aagtcactct 24780 
ggagtgttgg tggaactcga gggtgacaac gcgcgcctag ccgtactaaa acgcagcatc 24 840 
gaggtcaccc actttgccta cccggcactt aacctacccc ccaaggtcat gagcacagtc 24 900 
atgagtgagc tgatcgtgcg ccgtgcgcag cccctggaga gggatgcaaa tttgcaagaa 24 960 
caaacagagg agggcctacc cgcagttggc gacgagcagc tagcgcgctg gcttcaaacg 25020 
cgcgagcctg ccgacttgga ggagcgacgc aaactaatga tggccgcagt gctcgttacc 25080 
gtggagcttg agtgcatgca gcggttcttt gctgacccgg agatgcagcg caagctagag 25140 
gaaacattgc actacacctt tcgacagggc tacgtacgcc aggcctgcaa gatctccaac 25200 
gtggagctct gcaacctggt ctcctacctt ggaattttgc acgaaaaccg ccttgggcaa 25260 
aacgtgcttc attccacgct caagggcgag gcgcgccgcg actacgtccg cgactgcgtt 25320 
tacttatttc tatgctacac ctggcagacg gccatgggcg tttggcagca gtgcttggag 25380 
gagtgcaacc tcaaggagct gcagaaactg ctaaagcaaa acttgaagga cctatggacg 254 40 
gccttcaacg agcgctccgt ggccgcgcac ctggcggaca tcattttccc cgaacgcctg 25500 
cttaaaaccc tgcaacaggg tctgccagac ttcaccagtc aaagcatgtt gcagaacttt 25560 
aggaacttta tcctagagcg ctcaggaatc ttgcccgcca cctgctgtgc acttcctagc 25620 
gactttgtgc ccattaagta ccgcgaatgc cctccgccgc tttggggcca ctgctacctt 25680 
ctgcagctag ccaactacct tgcctaccac tctgacataa tggaagacgt gagcggtgac 25740 
ggtctactgg agtgtcactg tcgctgcaac ctatgcaccc cgcaccgctc cctggtttgc 25800 
aattcgcagc tgcttaacga aagtcaaatt atcggtacct ttgagctgca gggtccctcg 25860 
cctgacgaaa agtccgcggc tccggggttg aaactcactc cggggctgtg gacgtcggct 25920 
taccttcgca aatttgtacc tgaggactac cacgcccacg agattaggtt ctacgaagac 25980 
caatcccgcc cgccaaatgc ggagcttacc gcctgcgtca ttacccaggg ccacattctt 26040 
ggccaattgc aagccatcaa caaagcccgc caagagtttc tgctacgaaa gggacggggg 26100 
gtttacttgg acccccagtc cggcgaggag ctcaacccaa tccccccgcc gccgcagccc 26160 
tatcagcagc agccgcgggc ccttgcttcc caggatggca cccaaaaaga agctgcagct 26220 
gccgccgcca cccacggacg aggaggaata ctgggacagt caggcagagg aggttttgga 26280 
cgaggaggag gaggacatga tggaagactg ggagagccta gacgaggaag cttccgaggt 26340 
cgaagaggtg tcagacgaaa caccgtcacc ctcggtcgca ttcccctcgc cggcgcccca 26400 
gaaatcggca accggttcca gcatggctac aacctccgct cctcaggcgc cgccggcact 264 60 
gcccgttcgc cgacccaacc gtagatggga caccactgga accagggccg gtaagtccaa 26520 
gcagccgccg ccgttagccc aagagcaaca acagcgccaa ggctaccgct catggcgcgg 26580 
gcacaagaac gccatagttg cttgcttgca agactgtggg ggcaacatct ccttcgcccg 26640 
ccgctttctt ctctaccatc acggcgtggc cttcccccgt aacatcctgc attactaccg 26700 
tcatctctac agcccatact gcaccggcgg cagcggcagc ggcagcaaca gcagcggcca 26760 
cacagaagca aaggcgaccg gatagcaaga ctctgacaaa gcccaagaaa tccacagcgg 26820 
cggcagcagc aggaggagga gcgctgcgtc tggcgcccaa cgaacccgta tcgacccgcg 26880 
agcttagaaa caggattttt cccactctgt atgctatatt tcaacagagc aggggccaag 26940 
aacaagagct gaaaataaaa aacaggtctc tgcgatccct cacccgcagc tgcctgtatc 27000 
acaaaagcga agatcagctt cggcgcacgc tggaagacgc ggaggctctc ttcagtaaat 27060 
actgcgcgct gactcttaag gactagtttc gcgccctttc tcaaatttaa gcgcgaaaac 27120 
tacgtcatct ccagcggcca cacccggcgc cagcacctgt cgtcagcgcc attatgagca 27180 
aggaaattcc cacgccctac atgtggagtt accagccaca aatgggactt gcggctggag 27240 
ctgcccaaga ctactcaacc cgaataaact acatgagcgc gggaccccac atgatatccc 27300 
gggtcaacgg aatccgcgcc caccgaaacc gaattctctt ggaacaggcg" gctattacca 27360 
ccacacctcg taataacctt aatccccgta gttggcccgc tgccctggtg taccaggaaa 27420 
gtcccgctcc caccactgtg gtacttccca gagacgccca ggccgaagtt cagatgacta 27480 
actcaggggc gcagcttgcg ggcggctttc gtcacagggt gcggtxgccc gggcagggta 27540 
taactcacct gacaatcaga gggcgaggta ttcagctcaa cgacgagtcg gtgagctcct 27600 
cgcttggtct ccgtccggac gggacatttc agatcggcgg cgccggccgt ccttcattca 27660 
cgcctcgtca ggcaatccta actctgcaga cctcgtcctc tgagccgcgc tctggaggca 27720 
ttggaactct gcaatttatt gaggagtttg tgccatcggt ctactttaac cccttctcgg 27780 
gacctcccgg ccactatccg gatcaattta ttcctaactt tgacgcggta aaggactcgg 27840 
cggacggcta cgactgataa ttaagtggag aggcagagca actgcgcctg aaacacctgg 27900 
tccactgtcg ccgccacaag tgctttgccc gcgactccgg tgagttttgc tactttgaat 27960 
tgcccgagga tcatatcgag gatctttgtt gccatctctg tgctgagtat aataaataca 28020 
gaaattaaaa tatactgggg ctcctatcgc catcctgtaa acgccaccgt cttcacccgc 28080 
ccaagcaaac caaggcgaac cttacctggt acttttaaca tctctccctc tgtgatttac 28140 
aacagtttca acccagacgg agtgagtcta cgagagaacc tctccgagct cagctactcc 28200 
atcagaaaaa acaccaccct ccttacctgc cgggaacgta cccttaatta aaagtcaggc 28260 
ttcctggatg tcagcatctg actttggcca gcacctgtcc cgcggatttg ttccagtcca 28320 
actacagcga cccaccctaa cagagatgac caacacaacc aacgcggccg ccgctaccgg 28380 
acttacatct accacaaata caccccaagt ttctgccttt gtcaataact gggataactt 28440 
gggcatgtgg tggttctcca tagcgcttat gtttgtatgc cttattatta tgtggctcat 28500 
ctgctgccta aagcgcaaac gcgcccgacc acccatctat agtcccatca ttgtgctaca 28560 
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cccaaacaat gatggaatcc atagattgga cggactgaaa cacatgttct tttctcttac 28620 
agtatgatta aatgagatta attaaggaat ttctgtccag tttattcagc agcacctcct 28680 
tgccctcctc ccagctctgg tattgcagct tcctcctggc tgcaaacttt ctccacaatc 28740 
taaatggaat gtcagtttcc tcctgttcct gtccatccgc acccactatc ttcatgttgt 28800 
tgcagatgaa gcgcgcaaga ccgtctgaag ataccttcaa ccccgtgtat ccatatgaca 28860 
cggaaaccgg tcctccaact gtgccttttc ttactcctcc ctttgtatcc cccaatgggt 28920 
ttcaagagag tccccctggg gtactctctt tgcgcctatc cgaacctcta gttacctcca 28980 
atggcatgct tgcgctcaaa atgggcaacg gcctctctct ggacgaggcc ggcaacctta 29040 
cctcccaaaa tgtaaccact gtgagcccac ctctcaaaaa aaccaagtca aacataaacc 29100 
tggaaatatc tgcacccctc acagttacct cagaagccct aactgtggct gccgccgcac 29160 
ctctaatggt cgcgggcaac acactcacca tgcaatcaca ggccccgcta accgtgcacg 29220 
actccaaact tagcattgcc acccaaggac ccctcacagt gtcagaagga aagctagccc 29280 
tgcaaacatc aggccccctc accaccaccg atagcagtac ccttactatc actgcctcac 29340 
cccctctaac tactgccact ggtagcttgg gcattgactt gaaagagccc atttatacac 29400 
aaaatggaaa actaggacta aagtacgggg ctcctttgca tgtaacagac gacctaaaca 294 60 
ctttgaccgt agcaactggt ccaggtgtga ctattaataa tacttccttg caaactaaag 29520 
ttactggagc cttgggtttt gattcacaag gcaatatgca acttaatgta gcaggaggac 29580 
taaggattga ttctcaaaac agacgcctta tacttgatgt tagttatccg tttgatgctc 29640 
aaaaccaact aaatctaaga ctaggacagg gccctctttt tataaactca gcccacaact 29700 
tggatattaa ctacaacaaa ggcctttact tgtttacagc ttcaaacaat tccaaaaagc 29760 
ttgaggttaa cctaagcact gccaaggggt tgatgtttga cgctacagcc atagccatta 29820 
atgcaggaga tgggcttgaa tttggttcac ctaatgcacc aaacacaaat cccctcaaaa 29880 
caaaaattgg ccatggccta gaatttgatt caaacaaggc tatggttcct aaactaggaa 29940 
ctggccttag ttttgacagc acaggtgcca ttacagtagg aaacaaaaat aatgataagc 30000 
taactttgtg gaccacacca gctccatctc ctaactgtag actaaatgca gagaaagatg 30060 
ctaaactcac tttggtctta acaaaatgtg gcagtcaaat acttgctaca gtttcagttt 30120 
tggctgttaa aggcagtttg gctccaatat ctggaacagt tcaaagtgct catcttatta 30180 
taagatttga cgaaaatgga gtgctactaa acaattcctt cctggaccca gaatattgga 30240 
actttagaaa tggagatctt actgaaggca cagcctatac aaacgctgtt ggatttatgc 30300 
ctaacctatc agcttatcca aaatctcacg gtaaaactgc caaaagtaac attgtcagtc 30360 
aagtttactt aaacggagac aaaactaaac ctgtaacact aaccattaca ctaaacggta 30420 
cacaggaaac aggagacaca actccaagtg catactctat gtcattttca tgggactggt 30480 
ctggccacaa ctacattaat gaaatatttg ccacatcctc ttacactttt tcatacattg 30540 
cccaagaata aagaatcgtt tgtgttatgt ttcaacgtgt ttatttttca attgcagaaa 30600 
atttcaagtc atttttcatt cagtagtata gccccaccac cacatagctt atacagatca 30660 
ccgtacctta atcaaactca cagaacccta gtattcaacc tgccacctcc ctcccaacac 30720 
acagagtaca cagtcctttc tccccggctg gccttaaaaa gcatcatatc atgggtaaca 30780 
gacatattct taggtgttat attccacacg gtttcctgtc gagccaaacg ctcatcagtg 30840 
atattaataa actccccggg cagctcactt aagttcatgt cgctgtccag ctgctgagcc 30900 
acaggctgct gtccaacttg cggttgctta acgggcggcg aaggagaagt ccacgcctac 30960 
atgggggtag agtcataatc gtgcatcagg atagggcggt ggtgctgcag cagcgcgcga 31020 
ataaactgct gccgccgccg ctccgtcctg caggaataca acatggcagt ggtctcctca 31080 
gcgatgattc gcaccgcccg cagcataagg cgccttgtcc tccgggcaca gcagcgcacc 31140 
ctgatctcac ttaaatcagc acagtaactg cagcacagca ccacaatatt gttcaaaatc 31200 
ccacagtgca aggcgctgta tccaaagctc atggcgggga ccacagaacc cacgtggcca 31260 
tcataccaca agcgcaggta gattaagtgg cgacccctca taaacacgct ggacataaac 31320 
attacctctt ttggcatgtt gtaattcacc acctcccggt accatataaa cctctgatta 31380 
aacatggcgc catccaccac catcctaaac cagctggcca aaacctgccc gccggctata 31440 
cactgcaggg aaccgggact ggaacaatga cagtggagag cccaggactc gtaaccatgg 31500 
atcatcatgc tcgtcatgat atcaatgttg gcacaacaca ggcacacgtg catacacttc 31560 
ctcaggatta caagctcctc ccgcgttaga accatatccc agggaacaac ccattcctga 31620 
atcagcgtaa atcccacact gcagggaaga cctcgcacgt aactcacgtt gtgcattgtc 31680 
aaagtgttac attcgggcag cagcggatga tcctccagta tggtagcgcg ggtttctgtc 31740 
tcaaaaggag gtagacgatc cctactgtac ggagtgcgcc gagacaaccg agatcgtgtt 31800 
ggtcgtagtg tcatgccaaa tggaacgccg gacgtagtca tatttcctga agcaaaacca 31860 
ggtgcgggcg tgacaaacag atctgcgtct ccggtctcgc cgcttagatc gctctgtgta 31920 
gtagttgtag tatatccact ctctcaaagc atccaggcgc cccctggctt cgggttctat 31980 
gtaaactcct tcatgcgccg ctgccctgat aacatccacc accgcagaat aagccacacc 32040 
cagccaacct acacattcgt tctgcgagtc acacacggga ggagcgggaa gagctggaag 32100 
aaccatgttt ttttttttat tccaaaagat tatccaaaac ctcaaaatga agatctatta 32160 
agtgaacgcg ctcccctccg gtggcgtggt caaactctac agccaaagaa cagataatgg 32220 
catttgtaag atgttgcaca atggcttcca aaaggcaaac ggccctcacg tccaagtgga 32280 
cgtaaaggct aaacccttca gggtgaatct cctctataaa cattccagca ccttcaacca 32340 
tgcccaaata attctcatct cgccaccttc tcaatatatc tctaagcaaa tcccgaatat 32400 
taagtccggc cattgtaaaa atctgctcca gagcgccctc caccttcagc ctcaagcagc 324 60 
gaatcatgat tgcaaaaatt caggttcctc acagacctgt ataagattca aaagcggaac 32520 
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attaacaaaa ataccgcgat cccgtaggtc 

caggtctgca cggaccagcg cggccacttc 

actgattatg acacgcatac tcggagctat 

gttgcatggg cggcgatata aaatgcaagg 

gcaaaaaaga aagcacatcg tagtcatgct 

ccaccacaga aaaagacacc atttttctct 

caaaataaaa taacaaaaaa acatttaaac 

aacccttata agcataagac ggactacggc 

accgtgatta aaaagcacca ccgacagctc 

ctcggtaaac acatcaggtt gattcatcgg 

ggggaataca tacccgcagg cgtagagaca 

aattaatagg agagaaaaac acataaacac 

caccctcccg ctccagaaca acatacagcg 

accagtaaaa aagaaaacct attaaaaaaa 

tcacagtgta aaaaagggcc aagtgcagag 

acggttaaag tccacaaaaa acacccagaa 

aagccaaaaa acccacaact tcctcaaatc 

tcccatttta agaaaactac aattcccaac 

gtcacccgcc ccgttcccac gccccgcgcc 

attggcttca atccaaaata aggtatatta 

<210> 4 
<211> 34448 
<212> DNA 

<213> Adenovirus subgroup C 
<400> 4 

catcatcaat aatatacctt attttggatt 
ttgtgacgtg gcgcggggcg tgggaacggg 
gatgttgcaa gtgtggcgga acacatgtaa 
gtgtgcgccg gtgtacacag gaagtgacaa 
taaatttggg cgtaaccgag taagatttgg 
agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatat 
aatggccgcc agtcttttgg accagctgat 
tcctagccat tttgaaccac ctacccttca 
cgaagatccc aacgaggagg cggtttcgca 
gcaggaaggg attgacttac tcacttttcc 
cctttcccgg cagcccgagc agccggagca 
ccttgtaccg gaggtgatcg atcttacctg 
cgaggatgaa gagggtgagg agtttgtgtt 
caggtcttgt cattatcacc ggaggaatac 
ctatatgagg acctgtggca tgtttgtcta 
tagagtggtg ggtttggtgt ggtaattttt 
gaattttgta ttgtgatttt tttaaaaggt 
ccagaaccgg agcctgcaag acctacccgc 
cgcccgacat cacctgtgtc tagagaatgc 
ccttctaaca cacctcctga gatacacccg 
gccgtgagag ttggtgggcg tcgccaggct 
cctgggcaac ctttggactt gagctgtaaa 
ttgcgtgtgt ggttaacgcc tttgtttgct 
gagataatgt ttaacttgca tggcgtgtta 
cgccgtgggc taatcttggt tacatctgac 
ttttctgctg tgcgtaactt gctggaacag 
tttctgtggg gctcatccca ggcaaagtta 
gaatttgaag agcttttgaa atcctgtggt 
caggcgcttt tccaagagaa ggtcatcaag 
gcggctgctg ttgctttttt gagttttata 
agcggggggt acctgctgga ttttctggcc 
aagaatcgcc tgctactgtt gtcttccgtc 
cagcagcagc aggaggaagc caggcggcgg 
gccggcctgg accctcggga atgaatgttg 
gacgcatttt gacaattaca gaggatgggc 
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ccttcgcagg gccagctgaa cataatcgtg 32580 

cccgccagga accttgacaa aagaacccac 32640 

gctaaccagc gtagccccga tgtaagcttt 32700 

tgctgctcaa aaaatcaggc aaagcctcgc 32760 

catgcagata aaggcaggta agctccggaa 32820 

caaacatgtc tgcgggtttc tgcataaaca 32880 

attagaagcc tgtcttacaa caggaaaaac 32940 

catgccggcg tgaccgtaaa aaaactggtc 33000 

ctcggtcatg tccggagtca taatgtaaga 33060 

tcagtgctaa aaagcgaccg aaatagcccg 33120 

acattacagc ccccatagga ggtataacaa 33180 

ctgaaaaacc ctcctgccta ggcaaaatag 33240 

cttcacagcg gcagcctaac agtcagcctt 33300 

caccactcga cacggcacca gctcaatcag 33360 

cgagtatata taggactaaa aaatgacgta 33420 

aaccgcacgc gaacctacgc ccagaaacga 334 80 

gtcacttccg ttttcccacg ttacgtaact 33540 

acatacaagt tactccgccc taaaacctac 33600 

acgtcacaaa ctccaccccc tcattatcat 33660 

ttgatgatg 33699 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 240 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
tatctgccac ggaggtgtta ttaccgaaga 600 
cgaagaggta ctggctgata atcttccacc 660 
cgaactgtat gatttagacg tgacggcccc 720 
gatttttccc gactctgtaa tgttggcggt 780 
gccggcgccc ggttctccgg agccgcctca 840 
gagagccttg ggtccggttt ctatgccaaa 900 
ccacgaggct ggctttccac ccagtgacga 960 
agattatgtg gagcaccccg ggcacggttg 1020 
gggggaccca gatattatgt gttcgctttg 1080 
cagtaagtga aaattatggg cagtgggtga 1140 
tttttaattt ttacagtttt gtggtttaaa 1200 
cctgtgtctg aacctgagcc tgagcccgag 1260 
cgtcctaaaa tggcgcctgc tatcctgaga 1320 
aatagtagta cggatagctg tgactccggt 1380 
gtggtcccgc tgtgccccat taaaccagtt 1440 
gtggaatgta tcgaggactt gcttaacgag 1500 
cgccccaggc cataaggtgt aaacctgtga 1560 
gaatgagttg atgtaagttt aataaagggt 1620 
aatggggcgg ggcttaaagg gtatataatg 1680 
ctcatggagg cttgggagtg tttggaagat 174 0 
agctctaaca gtacctcttg gttttggagg 1800 
gtctgcagaa ttaaggagga ttacaagtgg 1660 
gagctgtttg attctttgaa tctgggtcac 1920 
actttggatt tttccacacc ggggcgcgct 1980 
aaggataaat ggagcgaaga aacccatctg 2040 
atgcatctgt ggagagcggt tgtgagacac 2100 
cgcccggcga taataccgac ggaggagcag 2160 
cggcaggagc agagcccatg gaacccgaga 2220 
tacaggtggc tgaactgtat ccagaactga 2280 
aggggctaaa gggggtaaag agggagcggg 2340 
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gggcttgtga ggctacagag gaggctagga 
gtcctgagtg tattactttt caacagatca 
tggcgcagaa gtattccata gagcagctga 
ttgaggaggc tattagggta tatgcaaagg 
tcagcaaact tgtaaatatc aggaattgtt 
agatagatac ggaggatagg gtggccttta 
tgcttggcat ggacggggtg gttattatga 
gtacggtttt cctggccaat accaacctta 
acaatacctg tgtggaagcc tggaccgatg 
gctggaaggg ggtggtgtgt cgccccaaaa 
aaaggtgtac cttgggtatc ctgtctgagg 
ccgactgtgg ttgcttcatg ctagtgaaaa 
gtggcaactg cgaggacagg gcctctcaga 
tgctgaagac cattcacgta gccagccact 
acatactgac ccgctgttcc ttgcatttgg 
aatgcaattt gagtcacact aagatattgc 
tgaacggggt gtttgacatg accatgaaga 
gcaccaggtg cagaccctgc gagtgtggcg 
tggatgtgac cgaggagctg aggcccgatc 
ttggctctag cgatgaagat acagattgag 
tgggaaagaa tatataaggt gggggtctta 
ccgccgccat gagcaccaac tcgtttgatg 
gcatgccccc atgggccggg gtgcgtcaga 
ccgtcctgcc cgcaaactct actaccttga 
agactgcagc ctccgccgcc gcttcagccg 
actttgcttt cctgagcccg cttgcaagca 
acaagttgac ggctcttttg gcacaattgg 
ctcagcagct gttggatctg cgccagcagg 
atgcggttta aaacataaat aaaaaaccag 
cttgctgtct ttatttaggg gttttgcgcg 
cgttgagggt cctgtgtatt ttttccagga 
acatgggcat aagcccgtct ctggggtgga 
gggtggtgtt gtagatgatc cagtcgtagc 
ctttcagtag caagctgatt gccaggggca 
taagctggga tgggtgcata cgtggggata 
tggctatgtt cccagccata tccctccggg 
tgtatccggt gcacttggga aatttgtcat 
tggagacgcc cttgtgacct ccaagatttt 
gcccacgggc ggcggcctgg gcgaagatat 
ccaggatgag atcgtcatag gccattttta 
gtataatggt tccatccggc ccaggggcgt 
ctttgagttc agatgggggg atcatgtcta 
gggtagggga gatcagctgg gaagaaagca 
cggtgggccc gtaaatcaca cctattaccg 
tgccgtcatc cctgagcagg ggggccactt 
ccctgaccaa atccgccaga aggcgctcgc 
caaagttttt caacggtttg agaccgtccg 
gcagttccag gcggtcccac agctcggtca 
ctcctcgttt cgcgggttgg ggcggctttc 
acgggccagg gtcatgtctt tccacgggcg 
ggtgaagggg tgcgctccgg gctgcgcgct 
ggtgctgaag cgctgccggt cttcgccctg 
gtcatagtcc agcccctccg cggcgtggcc 
gccgcacgag gggcagtgca gacttttgag 
ttccggggag taggcatccg cgccgcaggc 
ggtgagctct ggccgttcgg ggtcaaaaac 
cttacctctg gtttccatga gccggtgtcc 
cccgtataca gacttgagag gcctgtcctc 
aaactcggac cactctgaga caaaggctcg 
ggaggggtag cggtcgttgt ccactagggg 
gtcgccctct tcggcatcaa ggaaggtgat 
tgttcctgaa ggggggctat aaaagggggt 
atcgctgtct gcgagggcca gctgttgggg 
ttctgcgcta agattgtcag tttccaaaaa 
ggtgatgcct ttgagggtgg ccgcatccat 
aagcttggtg gcaaacgacc cgtagagggc 



atctagcttt tagcttaatg accagacacc 2400 
aggataattg cgctaatgag cttgatctgc 24 60 
ccacttactg gctgcagcca ggggatgatt 2520 
tggcacttag gccagattgc aagtacaaga 2580 
gctacatttc tgggaacggg gccgaggtgg 2640 
gatgtagcat gataaatatg tggccggggg 2700 
atgtaaggtt tactggcccc aattttagcg 2760 
tcctacacgg tgtaagcttc tatgggttta 2820 
taagggttcg gggctgtgcc ttttactgct 2880 
gcagggcttc aattaagaaa tgcctctttg 2940 
gtaactccag ggtgcgccac aatgtggcct 3000 
gcgtggctgt gattaagcat aacatggtat 3060 
tgctgacctg ctcggacggc aactgtcacc 3120 
ctcgcaaggc ctggccagtg tttgagcata 3180 
gtaacaggag gggggtgttc ctaccttacc 3240 
ttgagcccga gagcatgtcc aaggtgaacc 3300 
tctggaaggt gctgaggtac gatgagaccc 3360 
gtaaacatat taggaaccag cctgtgatgc 3420 
acttggtgct ggcctgcacc cgcgctgagt 3480 
gtactgaaat gtgtgggcgt ggcttaaggg 3540 
tgtagttttg tatctgtttt gcagcagccg 3600 
gaagcattgt gagctcatat ttgacaacgc 3660 
atgtgatggg ctccagcatt gatggtcgcc 3720 
cctacgagac cgtgtctgga acgccgttgg 3780 
ctgcagccac cgcccgcggg attgtgactg 3840 
gtgcagcttc ccgttcatcc gcccgcgatg 3900 
attctttgac ccgggaactt aatgtcgttt 3960 
tttctgccct gaaggcttcc tcccctccca 4020 
actctgtttg gatttggatc aagcaagtgt 4080 
cgcggtaggc ccgggaccag cggtctcggt 4140 
cgtggtaaag gtgactctgg atgttcagat 4200 
ggtagcacca ctgcagagct tcatgctgcg 4260 
aggagcgctg ggcgtggtgc ctaaaaatgt 4320 
ggcccttggt gtaagtgttt acaaagcggt 4380 
tgagatgcat cttggactgt atttttaggt 4440 
gattcatgtt gtgcagaacc accagcacag 4500 
gtagcttaga aggaaatgcg tggaagaact 4560 
ccatgcattc gtccataatg atggcaatgg 4620 
ttctgggatc actaacgtca tagttgtgtt 4680 
caaagcgcgg gcggagggtg ccagactgcg 4740 
agttaccctc acagatttgc atttcccacg 4800 
cctgcggggc gatgaagaaa acggtttccg 4860 
ggttcctgag cagctgcgac ttaccgcagc 4920 
ggtgcaactg gtagttaaga gagctgcagc 4980 
cgttaagcat gtccctgact cgcatgtttt 5040 
cgcccagcga tagcagttct tgcaaggaag 5100 
ccgtaggcat gcttttgagc gtttgaccaa 5160 
cctgctctac ggcatctcga tccagcatat 5220 
gctgtacggc agtagtcggt gctcgtccag 5280 
cagggtcctc gtcagcgtag tctgggtcac 5340 
ggccagggtg cgcttgaggc tggtcctgct 5400 
cgcgtcggcc aggtagcatt tgaccatggt 5460 
cttggcgcgc agcttgccct tggaggaggc 5520 
ggcgtagagc ttgggcgcga gaaataccga 5580 
cccgcagacg gtctcgcatt ccacgagcca 5640 
caggtttccc ccatgctttt tgatgcgttt 5700 
acgctcggtg acgaaaaggc tgtccgtgtc 5760 
gagcggtgtt ccgcggtcct cctcgtatag 5820 
cgtccaggcc agcacgaagg aggctaagtg 5880 
gtccactcgc tccagggtgt gaagacacat 5940 
tggtttgtag gtgtaggcca cgtgaccggg 6000 
gggggcgcgt tcgtcctcac tctcttccgc 6060 
tgagtactcc ctctgaaaag cgggcatgac 6120 
cgaggaggat ttgatattca cctggcccgc 6180 
ctggtcagaa aagacaatct ttttgttgtc 6240 
gttggacagc aacttggcga tggagcgcag 6300 
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ggtttggttt ttgtcgcgat cggcgcgctc cttggccgcg atgtttagct gcacgtattc 6360 
gcgcgcaacg caccgccatt cgggaaagac ggtggtgcgc tcgtcgggca ccaggtgcac 6420 
gcgccaaccg cggttgtgca gggtgacaag gtcaacgctg gtggctacct ctccgcgtag 6480 
gcgctcgttg gtccagcaga ggcggccgcc cttgcgcgag cagaatggcg gtagggggtc 6540 
tagctgcgtc tcgtccgggg ggtctgcgtc cacggtaaag accccgggca gcaggcgcgc 6600 
gtcgaagtag tctatcttgc atccttgcaa gtctagcgcc tgctgccatg cgcgggcggc 6660 
aagcgcgcgc tcgtatgggt tgagtggggg accccatggc atggggtggg tgagcgcgga 6720 
ggcgtacatg ccgcaaatgt cgtaaacgta gaggggctct ctgagtattc caagatatgt 6780 
agggtagcat cttccaccgc ggatgctggc gcgcacgtaa tcgtatagtt cgtgcgaggg 6840 
agcgaggagg tcgggaccga ggttgctacg ggcgggctgc tctgctcgga agactatctg 6900 
cctgaagatg gcatgtgagt tggatgatat ggttggacgc tggaagacgt tgaagctggc 6960 
gtctgtgaga cctaccgcgt cacgcacgaa ggaggcgtag gagtcgcgca gcttgttgac 7020 
cagctcggcg gtgacctgca cgtctagggc gcagtagtcc agggtttcct tgatgatgtc 7080 
atacttatcc tgtccctttt ttttccacag ctcgcggttg aggacaaact cttcgcggtc 7140 
tttccagtac tcttggatcg gaaacccgtc ggcctccgaa cggtaagagc ctagcatgta 7200 
gaactggttg acggcctggt aggcgcagca tcccttttct acgggtagcg cgtatgcctg 7260 
cgcggccttc cggagcgagg tgtgggtgag cgcaaaggtg tccctgacca tgactttgag 7320 
gtactggtat ttgaagtcag tgtcgtcgca tccgccctgc tcccagagca aaaagtccgt 7380 
gcgctttttg gaacgcggat ttggcagggc gaaggtgaca tcgttgaaga gtatctttcc 7440 
cgcgcgaggc ataaagttgc gtgtgatgcg gaagggtccc ggcacctcgg aacggttgtt 7500 
aattacctgg gcggcgagca cgatctcgtc aaagccgttg atgttgtggc ccacaatgta 7560 
aagttccaag aagcgcggga tgcccttgat ggaaggcaat tttttaagtt cctcgtaggt 7620 
gagctcttca ggggagctga gcccgtgctc tgaaagggcc cagtctgcaa gatgagggtt 7680 
ggaagcgacg aatgagctcc acaggtcacg ggccattagc atttgcaggt ggtcgcgaaa 7740 
ggtcctaaac tggcgaccta tggccatttt ttctggggtg atgcagtaga aggtaagcgg 7800 
gtcttgttcc cagcggtccc atccaaggtt cgcggctagg tctcgcgcgg cagtcactag 7860 
aggctcatct ccgccgaact tcatgaccag catgaagggc acgagctgct tcccaaaggc 7920 
ccccatccaa gtataggtct ctacatcgta ggtgacaaag agacgctcgg tgcgaggatg 7980 
cgagccgatc gggaagaact ggatctcccg ccaccaattg gaggagtggc tattgatgtg 8040 
gtgaaagtag aagtccctgc gacgggccga acactcgtgc tggcttttgt aaaaacgtgc 8100 
gcagtactgg cagcggtgca cgggctgtac atcctgcacg aggttgacct gacgaccgcg 8160 
cacaaggaag cagagtggga atttgagccc ctcgcctggc gggtttggct ggtggtcttc 8220 
tacttcggct gcttgtcctt gaccgtctgg ctgctcgagg ggagttacgg tggatcggac 8280 
caccacgccg cgcgagccca aagtccagat gtccgcgcgc ggcggtcgga gcttgatgac 8340 
aacatcgcgc agatgggagc tgtccatggt ctggagctcc cgcggcgtca ggtcaggcgg 8400 
gagctcctgc aggtttacct cgcatagacg ggtcagggcg cgggctagat ccaggtgata 8460 
cctaatttcc aggggctggt tggtggcggc gtcgatggct tgcaagaggc cgcatccccg 8520 
cggcgcgact acggtaccgc gcggcgggcg gtgggccgcg ggggtgtcct tggatgatgc 8580 
atctaaaagc ggtgacgcgg gcgagccccc ggaggtaggg ggggctccgg acccgccggg 8640 
agagggggca ggggcacgtc ggcgccgcgc gcgggcagga gctggtgctg cgcgcgtagg 8700 
ttgctggcga acgcgacgac gcggcggttg atctcctgaa tctggcgcct ctgcgtgaag 8760 
acgacgggcc cggtgagctt gagcctgaaa gagagttcga cagaatcaat ttcggtgtcg 8820 
ttgacggcgg cctggcgcaa aatctcctgc acgtctcctg agttgtcttg ataggcgatc 8880 
tcggccatga actgctcgat ctcttcctcc tggagatctc cgcgtccggc tcgctccacg 8940 
gtggcggcga ggtcgttgga aatgcgggcc atgagctgcg agaaggcgtt gaggcctccc 9000 
tcgttccaga cgcggctgta gaccacgccc ccttcggcat cgcgggcgcg catgaccacc 9060 
tgcgcgagat tgagctccac gtgccgggcg aagacggcgt agtttcgcag gcgctgaaag 9120 
aggtagttga gggtggtggc ggtgtgttct gccacgaaga agtacataac ccagcgtcgc 9180 
aacgtggatt cgttgatatc ccccaaggcc tcaaggcgct ccatggcctc gtagaagtcc 9240 
acggcgaagt tgaaaaactg ggagttgcgc gccgacacgg ttaactcctc ctccagaaga 9300 
cggatgagct cggcgacagt gtcgcgcacc tcgcgctcaa aggctacagg ggcctcttct 9360 
tcttcttcaa tctcctcttc cataagggcc tccccttctt cttcttctgg cggcggtggg 9420 
ggagggggga cacggcggcg acgacggcgc accgggaggc ggtcgacaaa gcgctcgatc 9480 
atctccccgc ggcgacggcg catggtctcg gtgacggcgc ggccgttctc gcgggggcgc 9540 
agttggaaga cgccgcccgt catgtcccgg ttatgggttg gcggggggct gccatgcggc 9600 
agggatacgg cgctaacgat gcatctcaac aattgttgtg taggtactcc gccgccgagg 9660 
gacctgagcg agtccgcatc gaccggatcg gaaaacctct cgagaaaggc gtctaaccag 9720 
tcacagtcgc aaggtaggct gagcaccgtg gcgggcggca gcgggcggcg gtcggggttg 9780 
tttctggcgg aggtgctgct gatgatgtaa ttaaagtagg cggtcttgag acggcggatg 9840 
gtcgacagaa gcaccatgtc cttgggtccg gcctgctgaa tgcgcaggcg gtcggccatg 9900 
ccccaggctt cgttttgaca tcggcgcagg tctttgtagt agtcttgcat gagcctttct 9960 
accggcactt cttcttctcc ttcctcttgt cctgcatctc ttgcatctat cgctgcggcg 10020 
gcggcggagt ttggccgtag gtggcgccct cttcctccca tgcgtgtgac cccgaagccc 10080 
ctcatcggct gaagcagggc taggtcggcg acaacgcgct cggctaatat ggcctgctgc 10140 
acctgcgtga gggtagactg gaagtcatcc atgtccacaa agcggtggta tgcgcccgtg 10200 
ttgatggtgt aagtgcagtt ggccataacg gaccagttaa cggtctggtg acccggctgc 10260 
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gagagctcgg tgtacctgag acgcgagtaa gccctcgagt caaatacgta gtcgttgcaa 10320 
gtccgcacca ggtactggta tcccaccaaa aagtgcggcg gcggctggcg gtagaggggc 10380 
cagcgtaggg tggccggggc tccgggggcg agatcttcca acataaggcg atgatatccg 104 40 
tagatgtacc tggacatcca ggtgatgccg gcggcggtgg tggaggcgcg cggaaagtcg 10500 
cggacgcggt tccagatgtt gcgcagcggc aaaaagtgct ccatggtcgg gacgctctgg 10560 
ccggtcaggc gcgcgcaatc gttgacgctc tagaccgtgc aaaaggagag cctgtaagcg 10620 
ggcactcttc cgtggtctgg tggataaatt cgcaagggta tcatggcgga cgaccggggt 10680 
tcgagccccg tatccggccg tccgccgtga tccatgcggt taccgcccgc gtgtcgaacc 10740 
caggtgtgcg acgtcagaca acgggggagt gctccttttg gcttccttcc aggcgcggcg 10800 
gctgctgcgc tagctttttt ggccactggc cgcgcgcagc gtaagcggtt aggctggaaa 10860 
gcgaaagcat taagtggctc gctccctgta gccggagggt tattttccaa gggttgagtc 10920 
gcgggacccc cggttcgagt ctcggaccgg ccggactgcg gcgaacgggg gtttgcctcc 10980 
ccgtcatgca agaccccgct tgcaaattcc tccggaaaca gggacgagcc ccttttttgc 11040 
ttttcccaga tgcatccggt gctgcggcag atgcgccccc ctcctcagca gcggcaagag 11100 
caagagcagc ggcagacatg cagggcaccc tcccctcctc ctaccgcgtc aggaggggcg 11160 
acatccgcgg ttgacgcggc agcagatggt gattacgaac ccccgcggcg ccgggcccgg 11220 
cactacctgg acttggagga gggcgagggc ctggcgcggc taggagcgcc ctctcctgag 11280 
cggtacccaa gggtgcagct gaagcgtgat acgcgtgagg cgtacgtgcc gcggcagaac 11340 
ctgtttcgcg accgcgaggg agaggagccc gaggagatgc gggatcgaaa gttccacgca 11400 
gggcgcgagc tgcggcatgg cctgaatcgc gagcggttgc tgcgcgagga ggactttgag 11460 
cccgacgcgc gaaccgggat tagtcccgcg cgcgcacacg tggcggccgc cgacctggta 11520 
accgcatacg agcagacggt gaaccaggag attaactttc aaaaaagctt taacaaccac 11580 
gtgcgtacgc ttgtggcgcg cgaggaggtg gctataggac tgatgcatct gtgggacttt 11640 
gtaagcgcgc tggagcaaaa cccaaatagc aagccgctca tggcgcagct gttccttata 11700 
gtgcagcaca gcagggacaa cgaggcattc agggatgcgc tgctaaacat agtagagccc 11760 
gagggccgct ggctgctcga tttgataaac atcctgcaga gcatagtggt gcaggagcgc 11820 
agcttgagcc tggctgacaa ggtggccgcc atcaactatt ccatgcttag cctgggcaag 118JB.0 . 
ttttacgccc gcaagatata ccatacccct tacgttccca tagacaagga ggtaaagatc 11940 
gaggggttct acatgcgcat ggcgctgaag gtgcttacct tgagcgacga cctgggcgtt 12000 
tatcgcaacg agcgcatcca caaggccgtg agcgtgagcc ggcggcgcga gctcagcgac 12060 
cgcgagctga tgcacagcct gcaaagggcc ctggctggca cgggcagcgg cgatagagag 12120 
gccgagtcct actttgacgc gggcgctgac ctgcgctggg ccccaagccg acgcgccctg 12180 
gaggcagctg gggccggacc tgggctggcg gtggcacccg cgcgcgctgg caacgtcggc 12240 
ggcgtggagg aatatgacga ggacgatgag tacgagccag aggacggcga gtactaagcg 12300 
gtgatgtttc tgatcagatg atgcaagacg caacggaccc ggcggtgcgg gcggcgctgc 12360 
agagccagcc gtccggcctt aactccacgg acgactggcg ccaggtcatg gaccgcatca 12420 
tgtcgctgac tgcgcgcaat cctgacgcgt tccggcagca gccgcaggcc aaccggctct 12480 
ccgcaattct ggaagcggtg gtcccggcgc gcgcaaaccc cacgcacgag aaggtgctgg 12540 
cgatcgtaaa cgcgctggcc gaaaacaggg ccatccggcc cgacgaggcc ggcctggtct 12600 
acgacgcgct gcttcagcgc gtggctcgtt acaacagcgg caacgtgcag accaacctgg 12660 
accggctggt gggggatgtg cgcgaggccg tggcgcagcg tgagcgcgcg cagcagcagg 12720 
gcaacctggg ctccatggtt gcactaaacg ccttcctgag tacacagccc gccaacgtgc 12780 
cgcggggaca ggaggactac accaactttg tgagcgcact gcggctaatg gtgactgaga 12840 
caccgcaaag tgaggtgtac cagtctgggc cagactattt tttccagacc agtagacaag 12900 
gcctgcagac cgtaaacctg agccaggctt tcaaaaactt gcaggggctg tggggggtgc 12960 
gggctcccac aggcgaccgc gcgaccgtgt ctagcttgct gacgcccaac tcgcgcctgt 13020 
tgctgctgct aatagcgccc ttcacggaca gtggcagcgt gtcccgggac acatacctag 13080 
gtcacttgct gacactgtac cgcgaggcca taggtcaggc gcatgtggac gagcatactt 13140 
tccaggagat tacaagtgtc agccgcgcgc tggggcagga ggacacgggc agcctggagg 13200 
caaccctaaa ctacctgctg accaaccggc ggcagaagat cccctcgttg cacagtttaa 13260 
acagcgagga ggagcgcatt ttgcgctacg tgcagcagag cgtgagcctt aacctgatgc 13320 
gcgacggggt aacgcccagc gtggcgctgg acatgaccgc gcgcaacatg gaaccgggca 13380 
tgtatgcctc aaaccggccg tttatcaacc gcctaatgga ctacttgcat cgcgcggccg 13440 
ccgtgaaccc cgagtatttc accaatgcca tcttgaaccc gcactggcta ccgccccctg 13500 
gtttctacac cgggggattc gaggtgcccg agggtaacga tggattcctc tgggacgaca 13560 
tagacgacag cgtgttttcc ccgcaaccgc agaccctgct agagttgcaa cagcgcgagc 13620 
aggcagaggc ggcgctgcga aaggaaagct tccgcaggcc aagcagcttg tccgatctag 13680 
gcgctgcggc cccgcggtca gatgctagta gcccatttcc aagcttgata gggtctctta 13740 
ccagcactcg caccacccgc ccgcgcctgc tgggcgagga ggagtaccta aacaactcgc 13800 
tgctgcagcc gcagcgcgaa aaaaacctgc ctccggcatt tcccaacaac gggatagaga 13860 
gcctagtgga caagatgagt agatggaaga cgtacgcgca ggagcacagg gacgtgccag 13920 
gcccgcgccc gcccacccgt cgtcaaaggc acgaccgtca gcggggtctg gtgtgggagg 13980 
acgatgactc ggcagacgac agcagcgtcc tggatttggg agggagtggc aacccgtttg 14040 
cgcaccttcg ccccaggctg gggagaatgt tttaaaaaaa aaaaagcatg atgcaaaata 14100 
aaaaactcac caaggccatg gcaccgagcg ttggttttct tgtattcccc ttagtatgcg 14160 
gcgcgcggcg atgtatgagg aaggtcctcc tccctcctac gagagtgtgg tgagcgcggc 14220 
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gccagtggcg gcggcgctgg gttctccctt cgatgctccc ctggacccgc cgtttgtgcc 14280 
tccgcggtac ctgcggccta ccggggggag aaacagcatc cgttactctg agttggcacc 14340 
cctattcgac accacccgtg tgtacctggt ggacaacaag tcaacggatg tggcatccct 14400 
gaactaccag aacgaccaca gcaactttct gaccacggtc attcaaaaca atgactacag 14 4 60 
cccgggggag gcaagcacac agaccatcaa tcttgacgac cggtcgcact ggggcggcga 14520 
cctgaaaacc atcctgcata ccaacatgcc aaatgtgaac gagttcatgt ttaccaataa 14580 
gtttaaggcg cgggtgatgg tgtcgcgctt gcctactaag gacaatcagg tggagctgaa 14640 
atacgagtgg gtggagttca cgctgcccga gggcaactac tccgagacca tgaccataga 14700 
ccttatgaac aacgcgatcg tggagcacta cttgaaagtg ggcagacaga acggggttct 14760 
ggaaagcgac atcggggtaa agtttgacac ccgcaacttc agactggggt ttgaccccgt 14820 
cactggtctt gtcatgcctg gggtatatac aaacgaagcc ttccatccag acatcatttt 14880 
gctgccagga tgcggggtgg acttcaccca cagccgcctg agcaacttgt tgggcatccg 14940 
caagcggcaa cccttccagg agggctttag gatcacctac gatgatctgg agggtggtaa 15000 
cattcccgca ctgttggatg tggacgccta ccaggcgagc ttgaaagatg acaccgaaca 15060 
gggcgggggt ggcgcaggcg gcagcaacag cagtggcagc ggcgcggaag agaactccaa 15120 
cgcggcagcc gcggcaatgc agccggtgga ggacatgaac gatcatgcca ttcgcggcga 15180 
cacctttgcc acacgggctg aggagaagcg cgctgaggcc gaagcagcgg ccgaagctgc 15240 
cgcccccgct gcgcaacccg aggtcgagaa gcctcagaag aaaccggtga tcaaacccct 15300 
gacagaggac agcaagaaac gcagttacaa cctaataagc aatgacagca ccttcaccca 15360 
gtaccgcagc tggtaccttg catacaacta cggcgaccct cagaccggaa tccgctcatg 15420 
gaccctgctt tgcactcctg acgtaacctg cggctcggag caggtctact ggtcgttgcc 15480 
agacatgatg caagaccccg tgaccttccg ctccacgcgc cagatcagca actttccggt 15540 
ggtgggcgcc gagctgttgc ccgtgcactc caagagcttc tacaacgacc aggccgtcta 15600 
ctcccaactc atccgccagt ttacctctct gacccacgtg ttcaatcgct ttcccgagaa 15660 
ccagattttg gcgcgcccgc cagcccccac catcaccacc gtcagtgaaa acgttcctgc 15720 
tctcacagat cacgggacgc taccgctgcg caacagcatc ggaggagtcc agcgagtgac 15780 
cattactgac gccagacgcc gcacctgccc ctacgtttac aaggccctgg gcatagtctc 15840 
gccgcgcgtc ctatcgagcc gcactttttg agcaagcatg tccatcctta tatcgcccag 15900 
caataacaca ggctggggcc tgcgcttccc aagcaagatg tttggcgggg ccaagaagcg 15960 
ctccgaccaa cacccagtgc gcgtgcgcgg gcactaccgc gcgccctggg gcgcgcacaa 16020 
acgcggccgc actgggcgca ccaccgtcga tgacgccatc gacgcggtgg tggaggaggc 16080 
gcgcaactac acgcccacgc cgccaccagt gtccacagtg gacgcggcca ttcagaccgt 16140 
ggtgcgcgga gcccggcgct atgctaaaat gaagagacgg cggaggcgcg tagcacgtcg 16200 
ccaccgccgc cgacccggca ctgccgccca acgcgcggcg gcggccctgc ttaaccgcgc 16260 
acgtcgcacc ggccgacggg cggccatgcg ggccgctcga aggctggccg cgggtattgt 16320 
cactgtgccc cccaggtcca ggcgacgagc ggccgccgca gcagccgcgg ccattagtgc 16380 
tatgactcag ggtcgcaggg gcaacgtgta ttgggtgcgc gactcggtta gcggcctgcg 16440 
cgtgcccgtg cgcacccgcc ccccgcgcaa ctagattgca agaaaaaact acttagactc 16500 
gtactgttgt atgtatccag cggcggcggc gcgcaacgaa gctatgtcca agcgcaaaat 16560 
caaagaagag atgctccagg tcatcgcgcc ggagatctat ggccccccga agaaggaaga 16620 
gcaggattac aagccccgaa agctaaagcg ggtcaaaaag aaaaagaaag atgatgatga 16680 
tgaacttgac gacgaggtgg aactgctgca cgctaccgcg cccaggcgac gggtacagtg 16740 
gaaaggtcga cgcgtaaaac gtgttttgcg acccggcacc accgtagtct ttacgcccgg 16800 
tgagcgctcc acccgcacct acaagcgcgt gtatgatgag gtgtacggcg acgaggacct 16860 
gcttgagcag gccaacgagc gcctcgggga gtttgcctac ggaaagcggc ataaggacat 16920 
gctggcgttg ccgctggacg agggcaaccc aacacctagc ctaaagcccg taacactgca 16980 
gcaggtgctg cccgcgcttg caccgtccga agaaaagcgc ggcctaaagc gcgagtctgg 17040 
tgacttggca cccaccgtgc agctgatggt acccaagcgc cagcgactgg aagatgtctt 17100 
ggaaaaaatg accgtggaac ctgggctgga gcccgaggtc cgcgtgcggc caatcaagca 17160 
ggtggcgccg ggactgggcg tgcagaccgt ggacgttcag atacccacta ccagtagcac 17220 
cagtattgcc accgccacag agggcatgga gacacaaacg tccccggttg cctcagcggt 17280 
ggcggatgcc gcggtgcagg cggtcgctgc ggccgcgtcc aagacctcta cggaggtgca 17340 
aacggacccg tggatgtttc gcgtttcagc cccccggcgc ccgcgcggtt cgaggaagta 17400 
cggcgccgcc agcgcgctac tgcccgaata tgccctacat ccttccattg cgcctacccc 17460 
cggctatcgt ggctacacct accgccccag aagacgagca actacccgac gccgaaccac 17520 
cactggaacc cgccgccgcc gtcgccgtcg ccagcccgtg ctggccccga tttccgtgcg 17580 
cagggtggct cgcgaaggag gcaggaccct ggtgctgcca acagcgcgct accaccccag 17640 
catcgtttaa aagccggtct ttgtggttct tgcagatatg gccctcacct gccgcctccg 17700 
tttcccggtg ccgggattcc gaggaagaat gcaccgtagg aggggcatgg ccggccacgg 17760 
cctgacgggc ggcatgcgtc gtgcgcacca ccggcggcgg cgcgcgtcgc accgtcgcat 17820 
gcgcggcggt atcctgcccc tccttattcc actgatcgcc gcggcgattg gcgccgtgcc 17880 
cggaattgca tccgtggcct tgcaggcgca gagacactga ttaaaaacaa gttgcatgtg 17940 
gaaaaatcaa aataaaaagt ctggactctc acgctcgctt ggtcctgtaa ctattttgta 18000 
gaatggaaga catcaacttt gcgtctctgg ccccgcgaca cggctcgcgc ccgttcatgg 18060 
gaaactggca agatatcggc accagcaata tgagcggtgg cgccttcagc tggggctcgc 18120 
tgtggagcgg cattaaaaat ttcggttcca ccgttaagaa ctatggcagc aaggcctgga 18180 
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acagcagcac aggccagatg ctgagggata agttgaaaga gcaaaatttc caacaaaagg 1824 0 

tggtagatgg cctggcctct ggcattagcg gggtggtgga cctggccaac caggcagtgc 18300 

aaaataagat taacagtaag cttgatcccc gccctcccgt agaggagcct ccaccggccg 18360 

tggagacagt gtctccagag gggcgtggcg aaaagcgtcc gcgccccgac agggaagaaa 18420 

ctctggtgac gcaaatagac gagcctccct cgtacgagga ggcactaaag caaggcctgc 18480 

ccaccacccg tcccatcgcg cccatggcta ccggagtgct gggccagcac acacccgtaa 18540 

cgctggacct gcctcccccc gccgacaccc agcagaaacc tgtgctgcca ggcccgaccg 18600 

ccgttgttgt aacccgtcct agccgcgcgt ccctgcgccg cgccgccagc ggtccgcgat 18660 

cgttgcggcc cgtagccagt ggcaactggc aaagcacact gaacagcatc gtgggtctgg 18720 

gggtgcaatc cctgaagcgc cgacgatgct tctgaatagc taacgtgtcg tatgtgtgtc 18780 

atgtatgcgt ccatgtcgcc gccagaggag ctgctgagcc gccgcgcgcc cgctttqcaa 18840 

gatggctacc ccttcgatga tgccgcagtg gtcttacatg cacatctcgg gccaggacgc 18900 

ctcggagtac ctgagccccg ggctggtgca gtttgcccgc gccaccgaga cgtacttcag 18960 

cctgaataac aagtttagaa accccacggt ggcgcctacg cacgacgtga ccacagaccg 19020 

gtcccagcgt ttgacgctgc ggttcatccc tgtggaccgt gaggatactg cgtactcgta 19080 

caaggcgcgg ttcaccctag ctgtgggtga taaccgtgtg ctggacatgg cttccacgta 19140 

ctttgacatc cgcggcgtgc tggacagggg ccctactttt aagccctact ctggcactgc 19200 

ctacaacgcc ctggctccca agggtgcccc aaatccttgc gaatgggatg aagctgctac 19260 

tgctcttgaa ataaacctag aagaagagga cgatgacaac gaagacgaag tagacgagca 19320 

agctgagcag caaaaaactc acgtatttgg gcaggcgcct tattctggta taaatattac 19380 

aaaggagggt attcaaatag gtgtcgaagg tcaaacacct aaatatgccg ataaaacatt 19440 

tcaacctgaa cctcaaatag gagaatctca gtggtacgaa actgaaatta atcatgcagc 19500 

tgggagagtc cttaaaaaga ctaccccaat gaaaccatgt tacggttcat atgcaaaacc 19560 

cacaaatgaa aatggagggc aaggcattct tgtaaagcaa caaaatggaa agctagaaag 19620 

tcaagtggaa atgcaatttt tctcaactac tgaggcgacc gcaggcaatg gtgataactt 19680 

gactcctaaa gtggtattgt acagtgaaga tgtagatata gaaaccccag acactcatat 19740 

ttcttacatg cccactatta aggaaggtaa ctcacgagaa ctaatgggcc aacaatctat 19800 

gcccaacagg cctaattaca ttgcttttag ggacaatttt attggtctaa tgtattacaa 19860 

cagcacgggt aatatgggtg ttctggcggg ccaagcatcg cagttgaatg ctgttgtaga 19920 

tttgcaagac agaaacacag agctttcata ccagcttttg cttgattcca ttggtgatag 19980 

aaccaggtac ttttctatgt ggaatcaggc tgttgacagc tatgatccag atgttagaat 20040 

tattgaaaat catggaactg aagatgaact tccaaattac tgctttccac tgggaggtgt 20100 

gattaataca gagactctta ccaaggtaaa acctaaaaca ggtcaggaaa atggatggga 20160 

aaaagatgct acagaatttt cagataaaaa tgaaataaga gttggaaata attttgccat 20220 

ggaaatcaat ctaaatgcca acctgtggag aaatttcctg tactccaaca tagcgctgta 20280 

tttgcccgac aagctaaagt acagtccttc caacgtaaaa atttctgata acccaaacac 20340 

ctacgactac atgaacaagc gagtggtggc tcccgggtta gtggactgct acattaacct 20400 

tggagcacgc tggtcccttg actatatgga caacgtcaac ccatttaacc accaccgcaa 204 60 

tgctggcctg cgctaccgct caatgttgct gggcaatggt cgctatgtgc ccttccacat 20520 

ccaggtgcct cagaagttct ttgccattaa aaacctcctt ctcctgccgg gctcatacac 20580 

ctacgagtgg aacttcagga aggatgttaa catggttctg cagagctccc taggaaatga 20640 

cctaagggtt gacggagcca gcattaagtt tgatagcatt tgcctttacg ccaccttctt 20700 

ccccatggcc cacaacaccg cctccacgct tgaggccatg cttagaaacg acaccaacga 20760 

ccagtccttt aacgactatc tctccgccgc caacatgctc taccctatac ccgccaacgc 20820 

taccaacgtg cccatatcca tcccctcccg caactgggcg gctttccgcg gctgggcctt 20880 

cacgcgcctt aagactaagg aaaccccatc actgggctcg ggctacgacc cttattacac 20940 

ctactctggc tctataccct acctagatgg aaccttttac ctcaaccaca cctttaagaa 21000 

ggtggccatt acctttgact cttctgtcag ctggcctggc aatgaccgcc tgcttacccc 21060 

caacgagttt gaaattaagc gctcagttga cggggagggt tacaacgttg cccagtgtaa 21120 

catgaccaaa gactggttcc tggtacaaat gctagctaac tacaacattg gctaccaggg 21180 

cttctatatc ccagagagct acaaggaccg catgtactcc ttctttagaa acttccagcc 21240 

catgagccgt caggtggtgg atgatactaa atacaaggac taccaacagg tgggcatcct 21300 

acaccaacac aacaactctg gatttgttgg ctaccttgcc cccaccatgc gcgaaggaca 21360 

ggcctaccct gctaacttcc cctatccgct tataggcaag accgcagttg acagcattac 21420 

ccagaaaaag tttctttgcg atcgcaccct ttggcgcatc ccattctcca gtaactttat 21480 

gtccatgggc gcactcacag acctgggcca aaaccttctc tacgccaact ccgcccacgc 21540 

gctagacatg acttttgagg tggatcccat ggacgagccc acccttcttt atgttttgtt 21600 

tgaagtcttt gacgtggtcc gtgtgcaccg gccgcaccgc ggcgtcatcg aaaccgtgta 21660 

cctgcgcacg cccttctcgg ccggcaacgc cacaacataa agaagcaagc aacatcaaca 21720 

acagctgccg ccatgggctc cagtgagcag gaactgaaag ccattgtcaa agatcttggt 21780 

tgtgggccat attttttggg cacctatgac aagcgctttc caggctttgt ttctccacac 21840 

aagctcgcct gcgccatagt caatacggcc ggtcgcgaga ctgggggcgt acactggatg 21900 

gcctttgcct ggaacccgca ctcaaaaaca tgctacctct ttgagccctt tggcttttct 21960 

gaccagcgac tcaagcaggt ttaccagttt gagtacgagt cactcctgcg ccgtagcgcc 22020 

attgcttctt cccccgaccg ctgtataacg ctggaaaagt ccacccaaag cgtacagggg 22080 

cccaactcgg ccgcctgtgg actattctgc tgcatgtttc tccacgcctt tgccaactgg 22140 



WO 01/04282 



33 



PCT/US00/18971 



ccccaaactc ccatggatca caaccccacc atgaacctta ttaccggggt acccaactcc 22200 
atgctcaaca gtccccaggt acagcccacc ctgcgtcgca accaggaaca gctctacagc 22260 
ttcctggagc gccactcgcc ctacttccgc agccacagtg cgcagattag gagcgccact 22320 
tctttttgtc acttgaaaaa catgtaaaaa taatgtacta gagacacttt caataaaggc 22380 
aaatgctttt atttgtacac tctcgggtga ttatttaccc ccacccttgc cgtctgcgcc 22440 
gtttaaaaat caaaggggtt ctgccgcgca tcgctatgcg ccactggcag ggacacgttg 22500 
cgatactggt gtttagtgct ccacttaaac tcaggcacaa ccatccgcgg cagctcggtg 22560 
aagttttcac tccacaggct gcgcaccatc accaacgcgt ttagcaggtc gggcgccgat 22620 
atcttgaagt cgcagttggg gcctccgccc tgcgcgcgcg agttgcgata cacagggttg 22680 
cagcactgga acactatcag cgccgggtgg tgcacgctgg ccagcacgct cttgtcggag 22740 
atcagatccg cgtccaggtc ctccgcgttg ctcagggcga acggagtcaa ctttggtagc 22800 
tgccttccca aaaagggcgc gtgcccaggc tttgagttgc actcgcaccg tagtggcatc 22860 
aaaaggtgac cgtgcccggt ctgggcgtta ggatacagcg cctgcataaa agccttgatc 22920 
tgcttaaaag ccacctgagc ctttgcgcct tcagagaaga acatgccgca agacttgccg 22980 
gaaaactgat tggccggaca ggccgcgtcg tgcacgcagc accttgcgtc ggtgttggag 23040 
atctgcacca catttcggcc ccaccggttc ttcacgatct tggccttgct agactgctcc 23100 
ttcagcgcgc gctgcccgtt ttcgctcgtc acatccattt caatcacgtg ctccttattt 23160 
atcataatgc ttccgtgtag acacttaagc tcgccttcga tctcagcgca gcggtgcagc 23220 
cacaacgcgc agcccgtggg ctcgtgatgc ttgtaggtca cctctgcaaa cgactgcagg 23280 
tacgcctgca ggaatcgccc catcatcgtc acaaaggtct tgttgctggt gaaggtcagc 23340 
tgcaacccgc ggtgctcctc gttcagccag gtcttgcata cggccgccag agcttccact 23400 
tggtcaggca gtagtttgaa gttcgccttt agatcgttat ccacgtggta cttgtccatc 234 60 
agcgcgcgcg cagcctccat gcccttctcc cacgcagaca cgatcggcac actcagcggg 23520 
ttcatcaccg taatttcact ttccgcttcg ctgggctctt cctcttcctc ttgcgtccgc 23580 
ataccacgcg ccactgggtc gtcttcattc agccgccgca ctgtgcgctt acctcctttg 23640 
ccatgcttga ttagcaccgg tgggttgctg aaacccacca tttgtagcgc cacatcttct 23700 
ctttcttcct cgctgtccac gattacctct ggtgatggcg ggcgctcggg cttgggagaa 23760 
gggcgcttct ttttcttctt gggcgcaatg gccaaatccg ccgccgaggt cgatggccgc 23820 
gggctgggtg tgcgcggcac cagcgcgtct tgtgatgagt cttcctcgtc ctcggactcg 23880 
atacgccgcc tcatccgctt ttttgggggc gcccggggag gcggcggcga cggggacggg 23940 
gacgacacgt cctccatggt tgggggacgt cgcgccgcac cgcgtccgcg ctcgggggtg 24000 
gtttcgcgct gctcctcttc ccgactggcc atttccttct cctataggca gaaaaagatc 24060 
atggagtcag tcgagaagaa ggacagccta accgccccct ctgagttcgc caccaccgcc 24120 
tccaccgatg ccgccaacgc gcctaccacc ttccccgtcg aggcaccccc gcttgaggag 24180 
gaggaagtga ttatcgagca ggacccaggt tttgtaagcg aagacgacga ggaccgctca 24240 
gtaccaacag aggataaaaa gcaagaccag gacaacgcag aggcaaacga ggaacaagtc 24300 
gggcgggggg acgaaaggca tggcgactac ctagatgtgg gagacgacgt gctgttgaag 24360 
catctgcagc gccagtgcgc cattatctgc gacgcgttgc aagagcgcag cgatgtgccc 24420 
ctcgccatag cggatgtcag ccttgcctac gaacgccacc tattctcacc gcgcgtaccc 24480 
cccaaacgcc aagaaaacgg cacatgcgag cccaacccgc gcctcaactt ctaccccgta 24540 
tttgccgtgc cagaggtgct tgccacctat cacatctttt tccaaaactg caagataccc 24600 
ctatcctgcc gtgccaaccg cagccgagcg gacaagcagc tggccttgcg gcagggcgct 24660 
gtcatacctg atatcgcctc gctcaacgaa gtgccaaaaa tctttgaggg tcttggacgc 24720 
gacgagaagc gcgcggcaaa cgctctgcaa caggaaaaca gcgaaaatga aagtcactct 24780 
ggagtgttgg tggaactcga gggtgacaac gcgcgcctag ccgtactaaa acgcagcatc 24840 
gaggtcaccc actttgccta cccggcactt aacctacccc ccaaggtcat gagcacagtc 24 900 
atgagtgagc tgatcgtgcg ccgtgcgcag cccctggaga gggatgcaaa tttgcaagaa 24960 
caaacagagg agggcctacc cgcagttggc gacgagcagc tagcgcgctg gcttcaaacg 25020 
cgcgagcctg ccgacttgga ggagcgacgc aaactaatga tggccgcagt gctcgttacc 25080 
gtggagcttg agtgcatgca gcggttcttt gctgacccgg agatgcagcg caagctagag 25140 
gaaacattgc actacacctt tcgacagggc tacgtacgcc aggcctgcaa gatctccaac 25200 
gtggagctct gcaacctggt ctcctacctt ggaattttgc acgaaaaccg ccttgggcaa 25260 
aacgtgcttc attccacgct caagggcgag gcgcgccgcg actacgtccg cgactgcgtt 25320 
tacttatttc tatgctacac ctggcagacg gccatgggcg tttggcagca gtgcttggag 25380 
gagtgcaacc tcaaggagct gcagaaactg ctaaagcaaa acttgaagga cctatggacg 25440 
gccttcaacg agcgctccgt ggccgcgcac ctggcggaca tcattttccc cgaacgcctg 25500 
cttaaaaccc tgcaacaggg tctgccagac ttcaccagtc aaagcatgtt gcagaacttt 25560 
aggaacttta tcctagagcg ctcaggaatc ttgcccgcca cctgctgtgc acttcctagc 25620 
gactttgtgc ccattaagta ccgcgaatgc cctccgccgc tttggggcca ctgctacctt 25680 
ctgcagctag ccaactacct tgcctaccac tctgacataa tggaagacgt gagcggtgac 25740 
ggtctactgg agtgtcactg tcgctgcaac ctatgcaccc cgcaccgctc cctggtttgc 25800 
aattcgcagc tgcttaacga aagtcaaatt atcggtacct ttgagctgca gggtccctcg 25860' 
cctgacgaaa agtccgcggc tccggggttg aaactcactc cggggctgtg gacgtcggct 25920 
taccttcgca aatttgtacc tgaggactac cacgcccacg agattaggtt ctacgaagac 25980 
caatcccgcc cgccaaatgc ggagcttacc gcctgcgtca ttacccaggg ccacattctt 26040 
ggccaattgc aagccatcaa caaagcccgc caagagtttc tgctacgaaa gggacggggg 26100 
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gtttacttgg acccccagtc cggcgaggag ctcaacccaa 
tatcagcagc agccgcgggc ccttgcttcc caggatggca 
gccgccgcca cccacggacg aggaggaata ctgggacagt 
cgaggaggag gaggacatga tggaagactg ggagagccta 
cgaagaggtg tcagacgaaa caccgtcacc ctcggtcgca 
gaaatcggca accggttcca gcatggctac aacctccgct 
gcccgttcgc cgacccaacc gtagatggga caccactgga 
gcagccgccg ccgttagccc aagagcaaca acagcgccaa 
gcacaagaac gccatagttg cttgcttgca agactgtggg 
ccgctttctt ctctaccatc acggcgtggc cttcccccgt 
tcatctctac agcccatact gcaccggcgg cagcggcagc 
cacagaagca aaggcgaccg gatagcaaga ctctgacaaa 
cggcagcagc aggaggagga gcgctgcgtc tggcgcccaa 
agcttagaaa caggattttt cccactctgt atgctatatt 
aacaagagct gaaaataaaa aacaggtctc tgcgatccct 
acaaaagcga agatcagctt cggcgcacgc tggaagacgc 
actgcgcgct gactcttaag gactagtttc gcgccctttc 
tacgtcatct ccagcggcca cacccggcgc cagcacctgt 
aggaaattcc cacgccctac atgtggagtt accagccaca 
ctgcccaaga ctactcaacc cgaataaact acatgagcgc 
gggtcaacgg aatccgcgcc caccgaaacc gaattctctt 
ccacacctcg taataacctt aatccccgta gttggcccgc 
gtcccgctcc caccactgtg gtacttccca gagacgccca 
actcaggggc gcagcttgcg ggcggctttc gtcacagggt 
taactcacct gacaatcaga gggcgaggta ttcagctcaa 
cgcttggtct ccgtccggac gggacatttc agatcggcgg 
cgcctcgtca ggcaatccta actctgcaga cctcgtcctc 
ttggaactct gcaatttatt gaggagtttg tgccatcggt 
gacctcccgg ccactatccg gatcaattta ttcctaactt 
cggacggcta cgactgaatg ttaagtggag aggcagagca 
tccactgtcg ccgccacaag tgctttgccc gcgactccgg 
tgcccgagga tcatatcgag ggcccggcgc acggcgtccg 
ttgcccgtag cctgattcgg gagtttaccc agcgccccct 
gaccctgtgt tctcactgtg atttgcaact gtcctaacct 
gttgccatct ctgtgctgag tataataaat acagaaatta 
cgccatcctg taaacgccac cgtcttcacc cgcccaagca 
ggtactttta acatctctcc ctctgtgatt tacaacagtt 
ctacgagaga acctctccga gctcagctac tccatcagaa 
tgccgggaac gtacgagtgc gtcaccggcc gctgcaccac 
ccagactttt tccggacaga cctcaataac tctgtttacc 
aaaaccctta gggtattagg ccaaaggcgc agctactgtg 
caactctacg ggctattcta attcaggttt ctctagaagt 
atctgacttt ggccagcacc tgtcccgcgg atttgttcca 
cctaacagag atgaccaaca caaccaacgc ggccgccgct 
aaatacaccc caagtttctg cctttgtcaa taactgggat 
ctccatagcg cttatgtttg tatgccttat tattatgtgg 
caaacgcgcc cgaccaccca tctatagtcc catcattgtg 
aatccataga ttggacggac tgaaacacat 'gttcttttct 
gatctagaaa tggacggaat tattacagag cagcgcctgc 
gccgagcaac agcgcatgaa tcaagagctc caagacatgg 
aggggtatct tttgtctggt aaagcaggcc aaagtcacct 
caccgcctta gctacaagtt gccaaccaag cgtcagaaat 
aagcccatta ccataactca gcactcggta gaaaccgaag 
caaggacctg aggatctctg cacccttatt aagaccctgt 
ccctttaact aataaaaaaa aataataaag catcacttac 
tctgtccagt ttattcagca gcacctcctt gccctcctcc 
cctcctggct gcaaactttc tccacaatct aaatggaatg 
tccatccgca cccactatct tcatgttgtt gcagatgaag 
taccttcaac cccgtgtatc catatgacac ggaaaccggt 
tactcctccc tttgtatccc ccaatgggtt tcaagagagt 
gcgcctatcc gaacctctag ttacctccaa tggcatgctt 
cctctctctg gacgaggccg gcaaccttac ctcccaaaat 
tctcaaaaaa accaagtcaa acataaacct ggaaatatct 
agaagcccta actgtggctg ccgccgcacc tctaatggtc 
gcaatcacag gccccgctaa ccgtgcacga ctccaaactt 
cctcacagtg tcagaaggaa agctagccct gcaaacatca 



tccccccgcc gccgcagccc 26160 
cccaaaaaga agctgcagct 26220 
caggcagagg aggttttgga 26280 
gacgaggaag cttccgaggt 26340 
ttcccctcgc cggcgcccca 26400 
cctcaggcgc cgccggcact 264 60 
accagggccg gtaagtccaa 26520 
ggctaccgct catggcgcgg 26580 
ggcaacatct ccttcgcccg 26640 
aaqatcctgc attactaccg 26700 
ggcagcaaca gcagcggcca 26760 
gcccaagaaa tccacagcgg 26820 
cgaacccgta tcgacccgcg 26880 
tcaacagagc aggggccaag 26940 
cacccgcagc tgcctgtatc 27000 
ggaggctctc ttcagtaaat 27060 
tcaaatttaa gcgcgaaaac 27120 
cgtcagcgcc attatgagca 27180 
aatgggactt gcggctggag 27240 
gggaccccac atgatatccc 27300 
ggaacaggcg gctattacca 27360 
tgccctggtg taccaggaaa 27420 
ggccgaagtt cagatgacta 27480 
gcggtcgccc gggcagggta 27540 
cgacgagtcg gtgagctcct 27600 
cgccggccgt ccttcattca 27660 
tgagccgcgc tctggaggca 27720 
ctactttaac cccttctcgg 27780 
tgacgcggta aaggactcgg 27840 
actgcgcctg aaacacctgg 27900 
tgagttttgc tactttgaat 27960 
gcttaccgcc cagggagagc 28020 
gctagttgag cgggacaggg 28080 
tggattacat caagatcttt 28140 
aaatatactg gggctcctat 28200 
aaccaaggcg aaccttacct 28260 
tcaacccaga cggagtgagt 28320 
aaaacaccac cctccttacc 28380 
acctaccgcc tgaccgtaaa 28440 
agaacaggag gtgagcttag 28500 
gggtttatga acaattcaag 2B560 
caggcttcct ggatgtcagc 28620 
gtccaactac agcgacccac 28680 
accggactta catctaccac 28740 
aacttgggca tgtggtggtt 28800 
ctcatctgct gcctaaagcg 28860 
ctacacccaa acaatgatgg 28920 
cttacagtat gattaaatga 28980 
tagaaagacg cagggcagcg 29040 
ttaacttgca ccagtgcaaa 29100 
acgacagtaa taccaccgga 29160 
tggtggtcat ggtgggagaa 29220 
gctgcattca ctcaccttgt 29280 
gcggtctcaa agatcttatt 29340 
ttaaaatcag ttagcaaatt 29400 
cagctctggt attgcagctt 29460 
tcagtttcct cctgttcctg 29520 
cgcgcaagac cgtctgaaga 29580 
cctccaactg tgccttttct 29640 
ccccctgggg tactctcttt 29700 
gcgctcaaaa tgggcaacgg 29760 
gtaaccactg tgagcccacc 29820 
gcacccctca cagttacctc 29880 
gcgggcaaca cactcaccat 29940 
agcattgcca cccaaggacc 30000 
ggccccctca ccaccaccga 30060 
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tagcagtacc cttactatca ctgcctcacc ccctctaact actgccactg gtagcttggg 30120 
cattgacttg aaagagccca tttatacaca aaatggaaaa ctaggactaa agtacggggc 30180 
tcctttgcat gtaacagacg acctaaacac tttgaccgta gcaactggtc caggtgtgac 30240 
tattaataat acttccttgc aaactaaagt tactggagcc ttgggttttg attcacaagg 30300 
caatatgcaa cttaatgtag caggaggact aaggattgat tctcaaaaca gacgccttat 30360 
acttgatgtt agttatccgt ttgatgctca aaaccaacta aatctaagac taggacaggg 30420 
ccctcttttt ataaactcag cccacaactt ggatattaac tacaacaaag gcctttactt 30480 
gtttacagct tcaaacaatt ccaaaaagct tgaggttaac ctaagcactg ccaaggggtt 30540 
gatgtttgac gctacagcca tagccattaa tgcaggagat gggcttgaat ttggttcacc 30600 
taatgcacca aacacaaatc ccctcaaaac aaaaattggc catggcctag aatttgattc 30660 
aaacaaggct atggttccta aactaggaac tggccttagt tttgacagca caggtgccat 30720 
tacagtagga aacaaaaata atgataagct aactttgtgg accacaccag ctccatctcc 30780 
taactgtaga ctaaatgcag agaaagatgc taaactcact ttggtcttaa caaaatgtgg 30840 
cagtcaaata cttgctacag tttcagtttt ggctgttaaa ggcagtttgg ctccaatatc 30900 
tggaacagtt caaagtgctc atcttattat aagatttgac gaaaatggag tgctactaaa 30960 
caattccttc ctggacccag aatattggaa ctttagaaat ggagatctta ctgaaggcac 31020 
agcctataca aacgctgttg gatttatgcc taacctatca gcttatccaa aatctcacgg 31080 
taaaactgcc aaaagtaaca ttgtcagtca agtttactta aacggagaca aaactaaacc 31140 
tgtaacacta accattacac taaacggtac acaggaaaca ggagacacaa ctccaagtgc 31200 
atactctatg tcattttcat gggactggtc tggccacaac tacattaatg aaatatttgc 31260 
cacatcctct tacacttttt catacattgc ccaagaataa agaatcgttt gtgttatgtt 31320 
tcaacgtgtt tatttttcaa ttgcagaaaa tttcaagtca tttttcattc agtagtatag 31380 
ccccaccacc acatagctta tacagatcac cgtaccttaa tcaaactcac agaaccctag 31440 
tattcaacct gccacctccc tcccaacaca cagagtacac agtcctttct ccccggctgg 31500 
ccttaaaaag catcatatca tgggtaacag acatattctt aggtgttata ttccacacgg 31560 
tttcctgtcg agccaaacgc tcatcagtga tattaataaa ctccccgggc agctcactta 31620 
agttcatgtc gctgtccagc tgctgagcca caggctgctg tccaacttgc ggttgcttaa 31680 
cgggcggcga aggagaagtc cacgcctaca tgggggtaga gtcataatcg tgcatcagga 31740 
tagggcggtg gtgctgcagc agcgcgcgaa taaactgctg ccgccgccgc tccgtcctgc 31800 
aggaatacaa catggcagtg gtctcctcag cgatgattcg caccgcccgc agcataaggc 31860 
gccttgtcct ccgggcacag cagcgcaccc tgatctcact taaatcagca cagtaactgc 31920 
agcacagcac cacaatattg ttcaaaatcc cacagtgcaa ggcgctgtat ccaaagctca 31980 
tggcggggac cacagaaccc acgtggccat cataccacaa gcgcaggtag attaagtggc 32040 
gacccctcat aaacacgctg gacataaaca ttacctcttt tggcatgttg taattcacca 32100 
cctcccggta ccatataaac ctctgattaa acatggcgcc atccaccacc atcctaaacc 32160 
agctggccaa aacctgcccg ccggctatac actgcaggga accgggactg gaacaatgac 32220 
agtggagagc ccaggactcg taaccatgga tcatcatgct cgtcatgata tcaatgttgg 32280 
cacaacacag gcacacgtgc atacacttcc tcaggattac aagctcctcc cgcgttagaa 32340 
ccatatccca gggaacaacc cattcctgaa tcagcgtaaa tcccacactg cagggaagac 324 00 
ctcgcacgta actcacgttg tgcattgtca aagtgttaca ttcgggcagc agcggatgat 324 60 
cctccagtat ggtagcgcgg gtttctgtct caaaaggagg tagacgatcc ctactgtacg 32520 
gagtgcgccg agacaaccga gatcgtgttg gtcgtagtgt catgccaaat ggaacgccgg 32580 
acgtagtcat atttcctgaa gcaaaaccag gtgcgggcgt gacaaacaga tctgcgtctc 32640 
cggtctcgcc gcttagatcg ctctgtgtag tagttgtagt atatccactc tctcaaagca 32700 
tccaggcgcc ccctggcttc gggttctatg taaactcctt catgcgccgc tgccctgata 32760 
acatccacca ccgcagaata agccacaccc agccaaccta cacattcgtt ctgcgagtca 32820 
cacacgggag gagcgggaag agctggaaga accatgtttt tttttttatt ccaaaagatt 32880 
atccaaaacc tcaaaatgaa gatctattaa gtgaacgcgc tcccctccgg tggcgtggtc 32940 
aaactctaca gccaaagaac agataatggc atttgtaaga tgttgcacaa tggcttccaa 33000 
aaggcaaacg gccctcacgt ccaagtggac gtaaaggcta aacccttcag ggtgaatctc 33060 
ctctataaac attccagcac cttcaaccat gcccaaataa ttctcatctc gccaccttct 33120 
caatatatct ctaagcaaat cccgaatatt aagtccggcc attgtaaaaa tctgctccag 33180 
agcgccctcc accttcagcc tcaagcagcg aatcatgatt gcaaaaattc aggttcctca 33240 
cagacctgta taagattcaa aagcggaaca ttaacaaaaa taccgcgatc ccgtaggtcc 33300 
cttcgcaggg ccagctgaac ataatcgtgc aggtctgcac ggaccagcgc ggccacttcc 33360 
ccgccaggaa ccttgacaaa agaacccaca ctgattatga cacgcatact cggagctatg 33420 
ctaaccagcg tagccccgat gtaagctttg ttgcatgggc ggcgatataa aatgcaaggt 33480 
gctgctcaaa aaatcaggca aagcctcgcg caaaaaagaa agcacatcgt agtcatgctc 33540 
atgcagataa aggcaggtaa gctccggaac caccacagaa aaagacacca tttttctctc 33600 
aaacatgtct gcgggtttct gcataaacac aaaataaaat aacaaaaaaa catttaaaca 33660 
ttagaagcct gtcttacaac aggaaaaaca acccttataa gcataagacg gactacggcc 33720 
atgccggcgt gaccgtaaaa aaactggtca ccgtgattaa aaagcaccac cgacagctcc 33780 
tcggtcatgt ccggagtcat aatgtaagac tcggtaaaca catcaggttg attcatcggt 33840 
cagtgctaaa aagcgaccga aatagcccgg gggaatacat acccgcaggc gtagagacaa 33900 
cattacagcc cccataggag gtataacaaa attaatagga gagaaaaaca cataaacacc 33960 
tgaaaaaccc tcctgcctag gcaaaatagc accctcccgc tccagaacaa catacagcgc 34020 
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ttcacagcgg cagcctaaca gtcagcctta ccagtaaaaa agaaaaccta ttaaaaaaac 34080 

accactcgac acggcaccag ctcaatcagt cacagtgtaa aaaagggcca agtgcagagc 34140 

gagtatatat aggactaaaa aatgacgtaa cggttaaagt ccacaaaaaa cacccagaaa 34200 

accgcacgcg aacctacgcc cagaaacgaa agccaaaaaa cccacaactt cctcaaatcg 34260 

tcacttccgt tttcccacgt tacgtaactt cccattttaa gaaaactaca attcccaaca 34320 

catacaagtt actccgccct aaaacctacg tcacccgccc cgttcccacg ccccgcgcca 34380 

cgtcacaaac tccaccccct cattatcata ttggcttcaa tccaaaataa ggtatattat 34440 

tgatgatg 34448 

<210> 5 
<211> 94 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 5 

Met Val Asp Thr Val Asn Ser Tyr Asn Thr Ala Thr Gly Leu Thr Ser 
1 5 10 15 

Ala Leu Asn Leu Pro Gin Val Ser Thr Phe Val Asn Asn Trp Ala Asn 
20 25 30 

Leu Gly Met Trp Trp Phe Ser lie Ala Leu Met Phe Val Cys Leu lie 
35 40 45 

He Met Trp Leu Ser Cys Cys Leu Lys Arg Lys Arg Ala Arg Pro Pro 
50 55 60 

He Tyr Lys Pro He He Val Leu Asn Pro Asn Asn Asp Gly He His 
65 70 75 80 

Arg Leu Asp Gly Leu Asn Thr Cys Ser Phe Ser Phe Ala Val 
85 90 



<210> 6 
<211> 101 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 6 

Met Thr Gly Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
15 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp Met Trp Trp Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg 
50 55 60 

Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He He Val Leu Asn Pro 
65 70 75 80 

His Asn Glu Lys He His Arg Leu Asp Gly Leu Lys Pro Cys Ser Leu 
85 90 95 

Leu Leu Gin Tyr Asp 
100 



<210> 7 
<211> 93 
<212> PRT 

<213> Adenovirus subgroup C 
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<400> 7 

Met Thr Asn Thr Thr Asn Ala Ala Ala Ala Thr Gly Leu Thr Ser Thr 
1-5 10 15 

Thr Asn Thr Pro Gin Val Ser Ala Phe Val Asn Asn Trp Asp Asn Leu 
20 25 30 

Gly Met Trp Trp Phe Ser lie Ala Leu Met Phe Val Cys Leu lie lie 
35 40 45 

Met Trp Leu lie Cys Cys Leu Lys Arg Lys Arg Ala Arg Pro Pro lie 
50 55 60 

Tyr Ser Pro lie lie Val Leu His Pro Asn Asn Asp Gly lie His Arg 
65 70 75 80 

Leu Asp Gly Leu Lys His Met Phe Phe Ser Leu Thr Val 
85 90 



<210> 8 
<211> 95 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 8 

Met Val Asp Thr Val Asn Ser Tyr Asn Thr Ala Thr Gly Leu Lys Ser 
1 5 10 15 

Ala Leu Asn Leu Pro Gin Val His Ala Phe Val Asn Asp Trp Ala Ser 
20 25 30 

Leu Gly Met Trp Trp Phe Ser He Ala Leu Met Phe Val Cys Leu He 
35 40 45 

He Met Trp Leu He Cys Cys Leu Lys Arg Arg Arg Ala Arg Pro Pro 
50 55 60 

He Tyr Arg Pro He He Val Leu Asn Pro His Asn Glu Lys He His 
65 70 75 80 

Arg Leu Asp Gly Leu Lys Pro Cys Ser Leu Leu Leu Gin Tyr Asp 
85 90 95 



<210> 9 
<211> 78 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 9 

Met Thr Gly Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
1 5 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp Met Trp Trp Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg 
50 55 60 

Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He He Val Leu 
65 70 75 
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<210> 10 
<211> 87 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 10 

Met Thr Gly Ser Thr lie Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
1 5 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp Met Trp Trp Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu lie He Met Trp Leu He Cys Cys Leu Lys Arg 
50 55 60 

Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He Gly Leu Lys Pro Cys 
65 70 75 80 

Ser Leu Leu Leu Gin Tyr Asp 
85 



<210> 11 
<211> 77 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 11 

Met Thr Gly Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
1 5 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp Met Trp Trp Phe Ser He Ala Leu 
35 40 45 

Met Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg 
50 55 60 

Arg Arg Ala Arg Pro Pro Ser Leu Leu Leu Gin Tyr Asp 
65 70 75 



<210> 12 
<211> 84 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 12 

Met Thr Gly Ser Thr He Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
15 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin He Ala Leu Met 
20 25 30 

Phe Val Cys Leu He He Met Trp Leu He Cys Cys Leu Lys Arg Arg 
35 40 45 

Arg Ala Arg Pro Pro lie Tyr Arg Pro He He Val Leu Asn Pro His 
50 55 60 
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Asn Glu Lys lie His Arg Leu Asp Gly Leu Lys Pro Cys Ser Leu Leu 
65 70 75 80 

Leu Gin Tyr Asp 



<210> 13 
<211> 35724 
<212> DNA 

<213> Adenovirus subgroup C 



<400> 13 

catcatcaat 

ttgtgacgtg 

gatgttgcaa 

gtgtgcgccg 

taaatttggg 

agtgaaatct 

gactttgacc 

cgggtcaaag 

tgagttcctc 

tccgacaccg 

ccattttgaa 

tcccaacgag 

agggattgac 

ccggcagccc 

tccacccagt 

ccccgggcac 

tatgtgttcg 

atgggcagtg 

gttttgtggt 

gagcctgagc 

cctgctatcc 

agctgtgact 

cccattaaac 

gacttgctta 

ggtgtaaacc 

agtttaataa 

aaagggtata 

gagtgtttgg 

tcttggtttt 

gaggattaca 

ttgaatctgg 

acaccggggc 

gaagaaaccc 

gcggttgtga 

ccgacggagg 

ccatggaacc 

tgtatccaga 

taaagaggga 

taatgaccag 

atgagcttga 

agccagggga 

attgcaagta 

acggggccga 

atatgtggcc 

gccccaattt 

gcttctatgg 

gtgcctttta 

agaaatgcct 

gccacaatgt 

agcataacat 

acggcaactg 

cagtgtttga 



aatatacctt 
gcgcggggcg 
gtgtggcgga 
gtgtacacag 
cgtaaccgag 
gaataatttt 
gtttacgtgg 
ttggcgtttt 
aagaggccac 
ggactgaaaa 
ccacctaccc 
gaggcggttt 
ttactcactt 
gagcagccgg 
gacgacgagg 
ggttgcaggt 
ctttgctata 
ggtgatagag 
ttaaagaatt 
ccgagccaga 
tgagacgccc 
ccggtccttc 
cagttgccgt 
acgagcctgg 
tgtgattgcg 
agggtgagat 
taatgcgccg 
aagatttttc 
ggaggtttct 
agtgggaatt 
gtcaccaggc 
gcgctgcggc 
atctgagcgg 
gacacaagaa 
agcagcagca 
cgagagccgg 
actgagacgc 
gcggggggct 
acaccgtcct 
tctgctggcg 
tgattttgag 
caagatcagc 
ggtggagata 
gggggtgctt 
tagcggtacg 
gtttaacaat 
ctgctgctgg 
ctttgaaagg 
ggcctccgac 
ggtatgtggc 
tcacctgctg 
gcataacata 



attttggatt 
tgggaacggg 
acacatgtaa 
gaagtgacaa 
taagatttgg 
gtgttactca 
agactcgccc 
attattatag 
tcttgagtgc 
tgagacatga 
ttcacgaact 
cgcagatttt 
ttccgccggc 
agcagagagc 
atgaagaggg 
cttgtcatta 
tgaggacctg 
tggtgggttt 
ttgtattgtg 
accggagcct 
gacatcacct 
taacacacct 
gagagttggt 
gcaacctttg 
tgtgtggtta 
aatgtttaac 
tgggctaatc 
tgctgtgcgt 
gtggggctca 
tgaagagctt 
gcttttccaa 
tgctgttgct 
ggggtacctg 
tcgcctgcta 
gcagcaggag 
cctggaccct 
attttgacaa 
tgtgaggcta 
gagtgtatta 
cagaagtatt 
gaggctatta 
aaacttgtaa 
gatacggagg 
ggcatggacg 
gttttcctgg 
acctgtgtgg 
aagggggtgg 
tgtaccttgg 
tgtggttgct 
aactgcgagg 
aagaccattc 
ctgacccgct 



gaagccaata 
gcgggtgacg 
gcgacggatg 
ttttcgcgcg 
ccattttcgc 
tagcgcgtaa 
aggtgttttt 
tcagctgacg 
cagcgagtag 
ggtactggct 
gtatgattta 
tcccgactct 
gcccggttct 
cttgggtccg 
tgaggagttt 
tcaccggagg 
tggcatgttt 
ggtgtggtaa 
atttttttaa 
gcaagaccta 
gtgtctagag 
cctgagatac 
gggcgtcgcc 
gacttgagct 
acgcctttgt 
ttgcatggcg 
ttggttacat 
aacttgctgg 
tcccaggcaa 
ttgaaatcct 
gagaaggtca 
tttttgagtt 
ctggattttc 
ctgttgtctt 
gaagccaggc 
cgggaatgaa 
ttacagagga 
cagaggaggc 
cttttcaaca 
ccatagagca 
gggtatatgc 
atatcaggaa 
atagggtggc 
gggtggttat 
ccaataccaa 
aagcctggac 
tgtgtcgccc 
gtatcctgtc 
tcatgctagt 
acagggcctc 
acgtagccag 
gttccttgca 



tgataatgag 

tagtagtgtg 

tggcaaaagt 

gttttaggcg 

gggaaaactg 

tatttgtcta 

ctcaggtgtt 

tgtagtgtat 

agttttctcc 

gataatcttc 

gacgtgacgg 

gtaatgttgg 

ccggagccgc 

gtttgccacg- 

gtgttagatt 

aatacggggg 

gtctacagta 

tttttttttt 

aaggtcctgt 

cccgccgtcc 

aatgcaatag 

acccggtggt 

aggctgtgga 

gtaaacgccc 

ttgctgaatg 

tgttaaatgg 

ctgacctcat 

aacagagctc 

agttagtctg 

gtggtgagct 

tcaagacttt 

ttataaagga 

tggccatgca 

ccgtccgccc 

ggcggcggca 

tgttgtacag 

tgggcagggg 

taggaatcta 

gatcaaggat 

gctgaccact 

aaaggtggca 

ttgttgctac 

ctttagatgt 

tatgaatgta 

ccttatccta 

cgatgtaagg 

caaaagcagg 

tgagggtaac 

gaaaagcgtg 

tcagatgctg 

ccactctcgc 

tttgggtaac 



ggggtggagt 
gcggaagtgt 
gacgtttttg 
gatgttgtag 
aataagagga 
gggccgcggg 
ttccgcgttc 
ttatacccgg 
tccgagccgc 
cacctcctag 
cccccgaaga 
cggtgcagga 
ctcacctttc 
aggctggctt 
atgtggagca 
acccagatat 
agtgaaaatt 
aatttttaca 
gtctgaacct 
taaaatggcg 
tagtacggat 
cccgctgtgc 
atgtatcgag 
caggccataa 
agttgatgta 
ggcggggctt 
ggaggcttgg 
taacagtacc 
cagaattaag 
gtttgattct 
ggatttttcc 
taaatggagc 
tctgtggaga 
ggcgataata 
ggagcagagc 
gtggctgaac 
ctaaaggggg 
gcttttagct 
aattgcgcta 
tactggctgc 
cttaggccag 
atttctggga 
agcatgataa 
aggtttactg 
cacggtgtaa 
gttcggggct 
gcttcaatta 
tccagggtgc 
gctgtgatta 
acctgctcgg 
aaggcctggc 
aggagggggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 
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tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 
gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 
gggcagcagg cgcgcgtcga agtagtctat 
ccatgcgcgg gcggcaagcg cgcgctcgta 
gtgggtgagc gcggaggcgt acatgccgca 
tattccaaga tatgtagggt agcatcttcc 
tagttcgtgc gagggagcga ggaggtcggg 
tcggaagact atctgcctga agatggcatg 
gacgttgaag ctggcgtctg tgagacctac 
gcgcagcttg ttgaccagct cggcggtgac 
ttccttgatg atgtcatact tatcctgtcc 
aaactcttcg cggtctttcc agtactcttg 



acactaagat attgcttgag cccgagagca 3180 

acatgaccat gaagatctgg aaggtgctga 3240 

cctgcgagtg tggcggtaaa catattagga 3300 

agctgaggcc cgatcacttg gtgctggcct 3360 

aagatacaga ttgaggtact gaaatgtgtg 3420 

aaggtggggg tcttatgtag ttttgtatct 3480 

ccaactcgtt tgatggaagc attgtgagct 3540 

ccggggtgcg tcagaatgtg atgggctcca 3600 

actctactac cttgacctac gagaccgtgt 3660 

ccgccgcttc agccgctgca gccaccgccc 3720 

gcccgcttgc aagcagtgca gcttcccgtt 3780 

ttttggcaca attggattct ttgacccggg 3840 

atctgcgcca gcaggtttct gccctgaagg 3900 

taaataaaaa accagactct gtttggattt 3960 

taggggtttt gcgcgcgcgg taggcccggg 4020 

gtattttttc caggacgtgg taaaggtgac 4080 

cgtctctggg gtggaggtag caccactgca 4140 

tgatccagtc gtagcaggag cgctgggcgt 4200 

tgattgccag gggcaggccc ttggtgtaag 4260 

gcatacgtgg ggatatgaga tgcatcttgg 4320 

ccatatccct ccggggattc atgttgtgca 4380 

tgggaaattt gtcatgtagc ttagaaggaa 4440 

gacctccaag attttccatg cattcgtcca 4500 

cctgggcgaa gatatttctg ggatcactaa 4560 

cataggccat ttttacaaag cgcgggcgga 4 620 

ccggcccagg ggcgtagtta ccctcacaga 4680 

gggggatcat gtctacctgc ggggcgatga 4740 

gctgggaaga aagcaggttc ctgagcagct 4800 

tcacacctat taccgggtgc aactggtagt 4860 

gcaggggggc cacttcgtta agcatgtccc 4920 

ccagaaggcg ctcgccgccc agcgatagca 4980 

gtttgagacc gtccgccgta ggcatgcttt 5040 

cccacagctc ggtcacctgc tctacggcat 5100 

gttggggcgg ctttcgctgt acggcagtag 5160 

gtctttccac gggcgcaggg tcctcgtcag 5220 

tccgggctgc gcgctggcca gggtgcgctt 5280 

ccggtcttcg ccctgcgcgt cggccaggta 5340 

ctccgcggcg tggcccttgg cgcgcagctt 5400 

gtgcagactt ttgagggcgt agagcttggg 54 60 

atccgcgccg caggccccgc agacggtctc 5520 

ttcggggtca aaaaccaggt ttcccccatg 5580 

catgagccgg tgtccacgct cggtgacgaa 5640 

gagaggcctg tcctcgagcg gtgttccgcg 5700 

tgagacaaag gctcgcgtcc aggccagcac 5760 

gttgtccact agggggtcca ctcgctccag 5820 

atcaaggaag gtgattggtt tgtaggtgta 5880 

gctataaaag ggggtggggg cgcgttcgtc 5940 

ggccagctgt tggggtgagt actccctctg 6000 

gtcagtttcc aaaaacgagg aggatttgat 6060 

ggtggccgca tccatctggt cagaaaagac 6120 

cgacccgtag agggcgttgg acagcaactt 6180 

gcgatcggcg cgctccttgg ccgcgatgtt 6240 

ccattcggga aagacggtgg tgcgctcgtc 6300 

gtgcagggtg acaaggtcaa cgctggtggc 6360 

gcagaggcgg ccgcccttgc gcgagcagaa 6420 

cggggggtct gcgtccacgg taaagacccc 6480 

cttgcatcct tgcaagtcta gcgcctgctg 6540 

tgggttgagt gggggacccc atggcatggg 6600 

aatgtcgtaa acgtagaggg gctctctgag 6660 

accgcggatg ctggcgcgca cgtaatcgta 6720 

accgaggttg ctacgggcgg gctgctctgc 6780 

tgagttggat gatatggttg gacgctggaa 6840 

cgcgtcacgc acgaaggagg cgtaggagtc 6900 

ctgcacgtct agggcgcagt agtccagggt 6960 

cttttttttc cacagctcgc ggttgaggac 7020 

gatcggaaac ccgtcggcct ccgaacggta 7080 
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agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 74 40 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 7500 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7620 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 7740 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7920 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 7980 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 8040 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 8340 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 8460 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8640 
cgcctctgcg tgaagacgac gggcccggtg agct'cgagcc tgaaagagag ttcgacagaa 8700 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8940 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 9000 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 9420 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 9480 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 9720 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
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gtcaggaggg gcgacatccg cggttgacgc 
gcgccgggcc cggcactacc tggacttgga 
gccctctcct gagcggtacc caagggtgca 
gccgcggcag aacctgtttc gcgaccgcga 
aaagttccac gcagggcgcg agctgcggca 
ggaggacttt gagcccgacg cgcgaaccgg 
cgccgacctg gtaaccgcat acgagcagac 
ctttaacaac cacgtgcgta cgcttgtggc 
tctgtgggac tttgtaagcg cgctggagca 
gctgttcctt atagtgcagc acagcaggga 
catagtagag cccgagggcc gctggctgct 
ggtgcaggag cgcagcttga gcctggctga 
tagcctgggc aagttttacg cccgcaagat 
ggaggtaaag atcgaggggt tctacatgcg 
cgacctgggc gtttatcgca acgagcgcat 
cgagctcagc gaccgcgagc tgatgcacag 
cggcgataga gaggccgagt cctactttga 
ccgacgcgcc ctggaggcag ctggggccgg 
tggcaacgtc ggcggcgtgg aggaatatga 
cgagtactaa gcggtgatgt ttctgatcag 
cgggcggcgc tgcagagcca gccgtccggc 
atggaccgca tcatgtcgct gactgcgcgc 
gccaaccggc tctccgcaat tctggaagcg 
gagaaggtgc tggcgatcgt aaacgcgctg 
gccggcctgg tctacgacgc gctgcttcag 
cagaccaacc tggaccggct ggtgggggat 
gcgcagcagc agggcaacct gggctccatg 
cccgccaacg tgccgcgggg acaggaggac 
atggtgactg agacaccgca aagtgaggtg 
accagtagac aaggcctgca gaccgtaaac 
ctgtgggggg tgcgggctcc cacaggcgac 
aactcgcgcc tgttgctgct gctaatagcg 
gacacatacc taggtcactt gctgacactg 
gacgagcata ctttccagga gattacaagt 
ggcagcctgg aggcaaccct aaactacctg 
ttgcacagtt taaacagcga ggaggagcgc 
cttaacctga tgcgcgacgg ggtaacgccc 
atggaaccgg gcatgtatgc ctcaaaccgg 
catcgcgcgg ccgccgtgaa ccccgagtat 
ctaccgcccc ctggtttcta caccggggga 
ctctgggacg acatagacga cagcgtgttt 
caacagcgcg agcaggcaga ggcggcgctg 
ttgtccgatc taggcgctgc ggccccgcgg 
atagggtctc ttaccagcac tcgcaccacc 
ctaaacaact cgctgctgca gccgcagcgc 
aacgggatag agagcctagt ggacaagatg 
agggacgtgc caggcccgcg cccgcccacc 
ctggtgtggg aggacgatga ctcggcagac 
ggcaacccgt ttgcgcacct tcgccccagg 
atgatgcaaa ataaaaaact caccaaggcc 
cccttagtat gcggcgcgcg gcgatgtatg 
tggtgagcgc ggcgccagtg gcggcggcgc 
cgccgtttgt gcctccgcgg tacctgcggc 
ctgagttggc acccctattc gacaccaccc 
atgtggcatc cctgaactac cagaacgacc 
acaatgacta cagcccgggg gaggcaagca 
actggggcgg cgacctgaaa accatcctgc 
tgtttaccaa taagtttaag gcgcgggtga 
aggtggagct gaaatacgag tgggtggagt 
ccatgaccat agaccttatg aacaacgcga 
agaacggggt tctggaaagc gacatcgggg 
ggtttgaccc cgtcactggt cttgtcatgc 
cagacatcat tttgctgcca ggatgcgggg 
tgttgggcat ccgcaagcgg caacccttcc 
tggagggtgg taacattccc gcactgttgg 
atgacaccga acagggcggg ggtggcgcag 



ggcagcagat ggtgattacg aacccccgcg 11100 
ggagggcgag ggcctggcgc ggctaggagc 11160 
gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gggagaggag cccgaggaga tgcgggatcg 11280 
tggcctgaat cgcgagcggt tgctgcgcga 11340 
gattagtccc gcgcgcgcac acgtggcggc 11400 
ggtgaaccag gagattaact ttcaaaaaag 114 60 
gcgcgaggag gtggctatag gactgatgca 11520 
aaacccaaat agcaagccgc tcatggcgca 11580 
caacgaggca ttcagggatg cgctgctaaa 11640 
cgatttgata aacatcctgc agagcatagt 11700 
caaggtggcc gccatcaact attccatgct 11760 
ataccatacc ccttacgttc ccatagacaa 11820 
catggcgctg aaggtgctta ccttgagcga 11880 
ccacaaggcc gtgagcgtga gccggcggcg 11940 
cctgcaaagg gccctggctg gcacgggcag 12000 
cgcgggcgct gacctgcgct gggccccaag 12060 
acctgggctg gcggtggcac ccgcgcgcgc 12120 
cgaggacgat gagtacgagc cagaggacgg 12180 
atgatgcaag acgcaacgga cccggcggtg 12240 
cttaactcca cggacgactg gcgccaggtc 12300 
aatcctgacg cgttccggca gcagccgcag 12360 
gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gccgaaaaca gggccatccg gcccgacgag 12480 
cgcgtggctc gttacaacag cggcaacgtg 12540 
gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gttgcactaa acgccttcct gagtacacag 12660 
tacaccaact ttgtgagcgc actgcggcta 12720 
taccagtctg ggccagacta ttttttccag 12780 
ctgagccagg ctttcaaaaa cttgcagggg 12840 
cgcgcgaccg tgtctagctt gctgacgccc 12900 
cccttcacgg acagtggcag cgtgtcccgg 12960 
taccgcgagg ccataggtca ggcgcatgtg 13020 
gtcagccgcg cgctggggca ggaggacacg 13080 
ctgaccaacc ggcggcagaa gatcccctcg 13140 
attttgcgct acgtgcagca gagcgtgagc 13200 
agcgtggcgc tggacatgac cgcgcgcaac 13260 
ccgtttatca accgcctaat ggactacttg 13320 
ttcaccaatg ccatcttgaa cccgcactgg 13380 
ttcgaggtgc ccgagggtaa cgatggattc 13440 
tccccgcaac cgcagaccct gctagagttg 13500 
cgaaaggaaa gcttccgcag gccaagcagc 13560 
tcagatgcta gtagcccatt tccaagcttg 13620 
cgcccgcgcc tgctgggcga ggaggagtac 13680 
gaaaaaaacc tgcctccggc atttcccaac 13740 
agtagatgga agacgtacgc gcaggagcac 13800 
cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
gacagcagcg tcotggattt gggagggagt 13920 
ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atggcaccga gcgttggttt tcttgtattc 14040 
aggaaggtcc tcctccctcc tacgagagtg 14100 
tgggttctcc cttcgatgct cccctggacc 14160 
ctaccggggg gagaaacagc atccgttact 14220 
gtgtgtacct ggtggacaac aagtcaacgg 14280 
acagcaactt tctgaccacg gtcattcaaa 14340 
cacagaccat caatcttgac gaccggtcgc 14400 
ataccaacat gccaaatgtg aacgagttca 14460 
tggtgtcgcg cttgcctact aaggacaatc 14520 
tcacgctgcc cgagggcaac tactccgaga 14580 
tcgtggagca ctacttgaaa gtgggcagac 14640 
taaagtttga cacccgcaac ttcagactgg 14700 
ctggggtata tacaaacgaa gccttccatc 14760 
tggacttcac ccacagccgc ctgagcaact 14820 
aggagggctt taggatcacc tacgatgatc 14880 
atgtggacgc ctaccaggcg agcttgaaag 14940 
gcggcagcaa cagcagtggc agcggcgcgg 15000 
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aagagaactc caacgcggca gccgcggcaa tgcagccggt ggaggacatg aacgatcatg 1S060 
ccattcgcgg cgacaccttt gccacacggg ctgaggagaa gcgcgctgag gccgaagcag 15120 
cggccgaagc tgccgccccc gctgcgcaac ccgaggtcga gaagcctcag aagaaaccgg 15180 
tgatcaaacc cctgacagag gacagcaaga aacgcagtta caacctaata agcaatgaca 15240 
gcaccttcac ccagtaccgc agctggtacc ttgcatacaa ctacggcgac cctcagaccg 15300 
gaatccgctc atggaccctg ctttgcactc ctgacgtaac ctgcggctcg gagcaggtct 15360 
actggtcgtt gccagacatg atgcaagacc ccgtgacctt ccgctccacg cgccagatca 15420 
gcaactttcc ggtggtgggc gccgagctgt tgcccgtgca ctccaagagc ttctacaacg 15480 
accaggccgt ctactcccaa ctcatccgcc agtttacctc tctgacccac gtgttcaatc 15540 
gctttcccga gaaccagatt ttggcgcgcc cgccagcccc caccatcacc accgtcagtg 15600 
aaaacgttcc tgctctcaca gatcacggga cgctaccgct gcgcaacagc atcggaggag 15660 
tccagcgagt gaccattact gacgccagac gccgcacctg cccctacgtt tacaaggccc 15720 
tgggcatagt ctcgccgcgc gtcctatcga gccgcacttt ttgagcaagc atgtccatcc 15780 
ttatatcgcc cagcaataac acaggctggg gcctgcgctt cccaagcaag atgtttggcg 15840 
gggccaagaa gcgctccgac caacacccag tgcgcgtgcg cgggcactac cgcgcgccct 15900 
ggggcgcgca caaacgcggc cgcactgggc gcaccaccgt cgatgacgcc atcgacgcgg 15960 
tggtggagga ggcgcgcaac tacacgccca cgccgccacc agtgtccaca gtggacgcgg 16020 
ccattcagac cgtggtgcgc ggagcccggc gctatgctaa aatgaagaga cggcggaggc 16080 
gcgtagcacg tcgccaccgc cgccgacccg gcactgccgc ccaacgcgcg gcggcggccc 16140 
tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgg 16200 
ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 
cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 
ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 
actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 16440 
ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 
cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 
gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 
tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 
gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 
ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 
ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 
tggaagatgt cttggaaaaa atgaccgtgg aacctgggct ggagcccgag gtccgcgtgc 17040 
ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 
ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 
ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 
ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 
gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 17340 
ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 
gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 174 60 
cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 
gctaccaccc cagcatcgtt taaaagccgg tctttgtggt tcttgcagat atggccctca 17580 
cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 
tggccggcca cggcctgacg ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 
ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17820 
caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17880 
taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 17940 
cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 
agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 
agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 
ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 
aaccaggcag tgcaaaataa gattaacagt aagcttgatc cccgccctcc cgtagaggag 18240 
cctccaccgg ccgtggagac agtgtctcca gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacagggaag aaactctggt gacgcaaata gacgagcctc cctcgtacga ggaggcacta 18360 
aagcaaggcc tgcccaccac ccgtcccatc gcgcccatgg ctaccggagt gctgggccag 18420 
cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 18480 
ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 
atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 
tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 
gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 
cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 
agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18900 
tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 
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ctgcgtactc gtacaaggcg cggttcaccc tagctgtggg tgataaccgt gtgctggaca 19020 
tggcttccac gtactttgac atccgcggcg tgctggacag gggccctact tttaagccct 19080 
actctggcac tgcctacaac gccctggctc ccaagggtgc cccaaatcct tgcgaatggg 19140 
atgaagctgc tactgctctt gaaataaacc tagaagaaga ggacgatgac aacgaagacg 19200 
aagtagacga gcaagctgag cagcaaaaaa ctcacgtatt tgggcaggcg ccttattctg 19260 
gtataaatat tacaaaggag ggtattcaaa taggtgtcga aggtcaaaca cctaaatatg 19320 
ccgataaaac atttcaacct gaacctcaaa taggagaatc tcagtggtac gaaactgaaa 19380 
ttaatcatgc agctgggaga gtccttaaaa agactacccc aatgaaacca tgttacggtt 19440 
catatgcaaa acccacaaat gaaaatggag ggcaaggcat tcttgtaaag caacaaaatg 19500 
gaaagctaga aagtcaagtg gaaatgcaat ttttctcaac tactgaggcg accgcaggca 19560 
atggtgataa cttgactcct aaagtggtat tgtacagtga agatgtagat atagaaaccc 19620 
cagacactca tatttcttac atgcccacta ttaaggaagg taactcacga gaactaatgg 19680 
gccaacaatc tatgcccaac aggcctaatt acattgcttt tagggacaat tttattggtc 19740 
taatgtatta caacagcacg ggtaatatgg gtgttctggc gggccaagca tcgcagttga 19800 
atgctgttgt agatttgcaa gacagaaaca cagagctttc ataccagctt ttgcttgatt 19860 
ccattggtga tagaaccagg tacttttcta tgtggaatca ggctgttgac agctatgatc 19920 
cagatgttag aattattgaa aatcatggaa ctgaagatga acttccaaat tactgctttc 19980 
cactgggagg tgtgattaat acagagactc ttaccaaggt aaaacctaaa acaggtcagg 20040 
aaaatggatg ggaaaaagat gctacagaat tttcagataa aaatgaaata agagttggaa 20100 
ataattttgc catggaaatc aatctaaatg ccaacctgtg gagaaatttc ctgtactcca 20160 
acatagcgct gtatttgccc gacaagctaa agtacagtcc ttccaacgta aaaatttctg 20220 
ataacccaaa cacctacgac tacatgaaca agcgagtggt ggctcccggg ttagtggact 20280 
gctacattaa ccttggagca cgctggtccc ttgactatat ggacaacgtc aacccattta 20340 
accaccaccg caatgctggc ctgcgctacc gctcaatgtt gctgggcaat ggtcgctatg 20400 
tgcccttcca catccaggtg cctcagaagt tctttgccat taaaaacctc cttctcctgc 20460 
cgggctcata cacctacgag tggaacttca ggaaggatgt taacatggtt ctgcagagct 20520 
ccctaggaaa tgacctaagg gttgacggag ccagcattaa gtttgatagc atttgccttt 20580 
acgccacctt cttccccatg gcccacaaca ccgcctccac gcttgaggcc atgcttagaa 20640 
acgacaccaa cgaccagtcc tttaacgact atctctccgc cgccaacatg ctctacccta 20700 
tacccgccaa cgctaccaac gtgcccatat ccatcccctc ccgcaactgg gcggctttcc 20760 
gcggctgggc cttcacgcgc cttaagacta aggaaacccc atcactgggc tcgggctacg 20820 
acccttatta cacctactct ggctctatac cctacctaga tggaaccttt tacctcaacc 20880 
acacctttaa gaaggtggcc attacctttg actcttctgt cagctggcct ggcaatgacc 20940 
gcctgcttac ccccaacgag tttgaaatta agcgctcagt tgacggggag ggttacaacg 21000 
ttgcccagtg taacatgacc aaagactggt tcctggtaca aatgctagct aactacaaca 21060 
ttggctacca gggcttctat atcccagaga gctacaagga ccgcatgtac tccttcttta 21120 
gaaacttcca gcccatgagc cgtcaggtgg tggatgatac taaatacaag gactaccaac 21180 
aggtgggcat cctacaccaa cacaacaact ctggatttgt tggctacctt gcccccacca 21240 
tgcgcgaagg acaggcctac cctgctaact tcccctatcc gcttataggc aagaccgcag 21300 
ttgacagcat tacccagaaa aagtttcttt gcgatcgcac cctttggcgc atcccattct 21360 
ccagtaactt tatgtccatg ggcgcactca cagacctggg ccaaaacctt ctctacgcca 21420 
actccgccca cgcgctagac atgacttttg aggtggatcc catggacgag cccacccttc 21480 
tttatgtttt gtttgaagtc tttgacgtgg tccgtgtgca ccggccgcac cgcggcgtca 21540 
tcgaaaccgt gtacctgcgc acgcccttct cggccggcaa cgccacaaca taaagaagca 21600 
agcaacatca acaacagctg ccgccatggg ctccagtgag caggaactga aagccattgt 21660 
caaagatctt ggttgtgggc catatttttt gggcacctat gacaagcgct ttccaggctt 21720 
tgtttctcca cacaagctcg cctgcgccat agtcaatacg gccggtcgcg agactggggg 21780 
cgtacactgg atggcctttg cctggaaccc gcactcaaaa acatgctacc tctttgagcc 21840 
ctttggcttt tctgaccagc gactcaagca ggtttaccag tttgagtacg agtcactcct 21900 
gcgccgtagc gccattgctt cttcccccga ccgctgtata acgctggaaa agtccaccca 21960 
aagcgtacag gggcccaact cggccgcctg tggactattc tgctgcatgt ttctccacgc 22020 
ctttgccaac tggccccaaa ctcccatgga tcacaacccc accatgaacc ttattaccgg 22080 
ggtacccaac tccatgctca acagtcccca ggtacagccc accctgcgtc gcaaccagga 22140 
acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
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gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 234 60 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24 660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 25440 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 
cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 
ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 
gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 
tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 
tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 
acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 
aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 
gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 
agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 
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agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 

ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 

taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 

gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 

cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 

cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 27240 

gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 

gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 

gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcggtcg 27420 

cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27480 

tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 

cgtccttcat tcacgcctcg tcaggcaatc ctaactctgc agacctcgtc ctctgagccg 27600 

cgctctggag gcattggaac tctgcaattt attgaggagt ttgtgccatc ggtctacttt 27660 

aaccccttct cgggacctcc cggccactat ccggatcaat ttattcctaa ctttgacgcg 27720 

gtaaaggact cggcggacgg ctacgactga atgttaagtg gagaggcaga gcaactgcgc 27780 

ctgaaacacc tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 27840 

tgctactttg aattgcccga ggatcatatc gagggcccgg cgcacggcgt ccggcttacc 27900 

gcccagggag agcttgcccg tagcctgatt cgggagttta cccagcgccc cctgctagtt 27960 

gagcgggaca ggggaccctg tgttctcact gtgatttgca actgtcctaa ccttggatta 28020 

catcaagatc tttgttgcca tctctgtgct gagtataata aatacagaaa ttaaaatata 28080 

ctggggctcc tatcgccatc ctgtaaacgc caccgtcttc acccgcccaa gcaaaccaag 28140 

gcgaacctta cctggtactt ttaacatctc tccctctgtg atttacaaca gtttcaaccc 28200 

agacggagtg agtctacgag agaacctctc cgagctcagc tactccatca gaaaaaacac 28260 

caccctcctt acctgccggg aacgtacgag tgcgtcaccg gccgctgcac cacacctacc 28320 

gcctgaccgt aaaccagact ttttccggac agacctcaat aactctgttt accagaacag 28380 

gaggtgagct tagaaaaccc ttagggtatt aggccaaagg cgcagctact gtggggttta 28440 

tgaacaattc aagcaactct acgggctatt ctaattcagg tttctctagg gttggggtta 28500 

ttctctgtct tgtgattctc tttattctta tactaacgct tct'ctgccta aggctcgccg 28560 

cctgctgtgt gcacatttgc atttattgtc agctttttaa acgctggggt cgccacccaa 28620 

gatgattagg tacataatcc taggtttact cacccttgcg tcagcccacg gtacttaatt 28680 

aacccaaaag gtggatttta aggagccagc ctgtaatgtt acattcgcag ctgaagctaa 28740 

tgagtgcacc actcttataa aatgcaccac agaacatgaa aagctgctta ttcgccacaa 28800 

aaacaaaatt ggcaagtatg ctgtttatgc tatttggcag ccaggtgaca ctacagagta 28860 

taatgttaca gttttccagg gtaaaagtca taaaactttt atgtatactt ttccatttta 28920 

tgaaatgtgc gacattacca tgtacatgag caaacagtat aagttgtggc ccccacaaaa 28980 

ttgtgtggaa aacactggca ctttctgctg cactgctatg ctaattacag tgctcgcttt 29040 

ggtctgtacc ctactctata ttaaatacaa aagcagacgc agctttattg aggaaaagaa 29100 

aatgccttaa tttactaagt tacaaagcta atgtcaccac taactgcttt actcgctgct 29160 

tgcaaaacaa attcaaaaag ttagcattat aattagaata ggatttaaac cccccggtca 29220 

tttcctgctc aataccattc ccctgaacaa ttgactctat gtgggatatg ctccagcgct 29280 

acaaccttga agtcaggctt cctggatgtc agcatctgac tttggccagc acctgtcccg 29340 

cggatttgtt ccagtccaac tacagcgacc caccctaaca gagatgacca acacaaccaa 29400 

cgcggccgcc gctaccggac ttacatctac cacaaataca ccccaagttt ctgcctttgt 29460 

caataactgg gataacttgg gcatgtggtg gttctccata gcgcttatgt ttgtatgcct 29520 

tattattatg tggctcatct gctgcctaaa gcgcaaacgc gcccgaccac ccatctatag 29580 

tcccatcatt gtgctacacc caaacaatga tggaatccat agattggacg gactgaaaca 29640 

catgttcttt tctcttacag tatgattaaa tgagacatga ttcctcgagt ttttatatta 29700 

ctgacccttg ttgcgctttt ttgtgcgtgc tccacattgg ctgcggtttc tcacatcgaa 29760 

gtagactgca ttccagcctt cacagtctat ttgctttacg gatttgtcac cctcacgctc 29820 

atctgcagcc tcatcactgt ggtcatcgcc tttatccagt gcattgactg ggtctgtgtg 29880 

cgctttgcat atctcagctg ctgccatgtt gtgttgctac catgttgttt tcatgtgttg 29940 

ctgccatgct cttgtcgcct tagatctctc tttatgtagt gttgtggtgt ctctcttgtc 30000 

gtgatgtgtg ttttgtccta tatattttaa tttttaatcc aaacccctgt ccccgcagag 30060 

gcctttgcgt tctggtaggc cgtcattgaa aactgactta actcgttaaa ttaaaaaaat 30120 

gtaaaaaata atggttgaga ctcagcccaa catcggcaga tgaggtggat tgagactcag 30180 

cccaacatcg gcagatgagg tggattgaga ctcaacccca acattggcag atgaggtgaa 30240 

ttagatgagg tggattgaga ctcatgaggg tggtatgagg gcccgacgtc cacaggtggg 30300 

agttgtgctt tacagtccaa cgtgcaggac gcttggcatt tgccagagaa caccaagatt 30360 

ggcaaattcg caactggcgc cctgtgctct tcacagacgg aaaaatgacc aaaatctgat 30420 

tatttttgta aaacggaaac cgaatgtccg acaaagttca tttgatgact tcccggtagg 30480 

tctgccctgc cgctgggccg acgccgtccg ggaattttac aaacgatttc ggacgtctag 30540 

cattcactca ccttgtcaag gacctgagga tctctgcacc cttattaaga ccctgtgcgg 30600 

tctcaaagat cttattccct ttaactaata aaaaaaaata ataaagcatc acttacttaa 30660 

aatcagttag caaatttctg tccagtttat tcagcagcac ctccttgccc tcctcccagc 30720 

tctggtattg cagcttcctc ctggctgcaa actttctcca caatctaaat ggaatgtcag 30780 

tttcctcctg ttcctgtxca tccgcaccca ctatcttcat gttgttgcag atgaagcgcg 30840 
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caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa accggtcctc 30900 
caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa gagagtcccc 30960 
ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc atgcttgcgc 31020 
tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc caaaatgtaa 31080 
ccactgtgag cccacctctc aaaaaaacca agtcaaacat aaacctggaa atatctgcac 31140 
ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta atggtcgcgg 31200 
gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc aaacttagca 31260 
ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa acatcaggcc 31320 
ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct ctaactactg 31380 
ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat ggaaaactag 314 40 
gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg accgtagcaa 31500 
ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact ggagccttgg 31560 
gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg attgattctc 31620 
aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac caactaaatc 31680 
taagactagg acagggccct ctttttataa actcagccca caacttggat attaactaca 31740 
acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag gttaacctaa 31800 
gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca ggagatgggc 31860 
ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa attggccatg 31920 
gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc cttagttttg 31980 
acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact ttgtggacca 32040 
caccagctcc atctcctaac tgtagactaa atgcagagaa agatgctaaa ctcactttgg 32100 
tcttaacaaa atgtggcagt caaatacttg ctacagtttc agttttggct gttaaaggca 32160 
gtttggctcc aatatctgga acagttcaaa gtgctcatct tattataaga tttgacgaaa 32220 
atggagtgct actaaacaat tccttcctgg acccagaata ttggaacttt agaaatggag 32280 
atcttactga aggcacagcc tatacaaacg ctgttggatt tatgcctaac ctatcagctt 32340 
atccaaaatc tcacggtaaa actgccaaaa gtaacattgt cagtcaagtt tacttaaacg 32400 
gagacaaaac taaacctgta acactaacca ttacactaaa cggtacacag gaaacaggag 324 60 
acacaactcc aagtgcatac tctatgtcat tttcatggga ctggtctggc cacaactaca 32520 
ttaatgaaat atttgccaca tcctcttaca ctttttcata cattgcccaa gaataaagaa 32580 
tcgtttgtgt tatgtttcaa cgtgtttatt tttcaattgc agaaaatttc aagtcatttt 32640 
tcattcagta gtatagcccc accaccacat agcttataca gatcaccgta ccttaatcaa 32700 
actcacagaa ccctagtatt caacctgcca cctccctccc aacacacaga gtacacagtc 32760 
ctttctcccc ggctggcctt aaaaagcatc atatcatggg taacagacat attcttaggt 32820 
gttatattcc acacggtttc ctgtcgagcc aaacgctcat cagtgatatt aataaactcc 32880 
ccgggcagct cacttaagtt catgtcgctg tccagctgct gagccacagg ctgctgtcca 32940 
acttgcggtt gcttaacggg cggcgaagga gaagtccacg cctacatggg ggtagagtca 33000 
taatcgtgca tcaggatagg gcggtggtgc tgcagcagcg cgcgaataaa ctgctgccgc 33060 
cgccgctccg tcctgcagga atacaacatg gcagtggtct cctcagcgat gattcgcacc 33120 
gcccgcagca taaggcgcct tgtcctccgg gcacagcagc gcaccctgat ctcacttaaa 33180 
tcagcacagt aactgcagca cagcaccaca atattgttca aaatcccaca gtgcaaggcg 33240 
ctgtatccaa agctcatggc ggggaccaca gaacccacgt ggccatcata ccacaagcgc 33300 
aggtagatta agtggcgacc cctcataaac acgctggaca taaacattac ctcttttggc 33360 
atgttgtaat tcaccacctc ccggtaccat ataaacctct gattaaacat ggcgccatcc 33420 
accaccatcc taaaccagct ggccaaaacc tgcccgccgg ctatacactg cagggaaccg 33480 
ggactggaac aatgacagtg gagagcccag gactcgtaac catggatcat catgctcgtc 33540 
atgatatcaa tgttggcaca acacaggcac. acgtgcatac acttcctcag gattacaagc 33600 
tcctcccgcg ttagaaccat atcccaggga acaacccatt cctgaatcag cgtaaatccc 33660 
acactgcagg gaagacctcg cacgtaactc acgttgtgca ttgtcaaagt gttacattcg 33720 
ggcagcagcg gatgatcctc cagtatggta gcgcgggttt ctgtctcaaa aggaggtaga 33780 
cgatccctac tgtacggagt gcgccgagac aaccgagatc gtgttggtcg tagtgtcatg 33840 
ccaaatggaa cgccggacgt agtcatattt cctgaagcaa aaccaggtgc gggcgtgaca 33900 
aacagatctg cgtctccggt ctcgccgctt agatcgctct gtgtagtagt tgtagtatat 33960 
ccactctctc aaagcatcca ggcgccccct ggcttcgggt tctatgtaaa ctccttcatg 34020 
cgccgctgcc ctgataacat ccaccaccgc agaataagcc acacccagcc aacctacaca 34080 
ttcgttctgc gagtcacaca cgggaggagc gggaagagct ggaagaacca tgtttttttt 34140 
tttattccaa aagattatcc aaaacctcaa aatgaagatc tattaagtga acgcgctccc 34200 
ctccggtggc gtggtcaaac tctacagcca aagaacagat aatggcattt gtaagatgtt 34260 
gcacaatggc ttccaaaagg caaacggccc tcacgtccaa gtggacgtaa aggctaaacc 34320 
cttcagggtg aatctcctct ataaacattc cagcaccttc aaccatgccc aaataattct 34380 
catctcgcca ccttctcaat atatctctaa gcaaatcccg aatattaagt ccggccattg 34440 
taaaaatctg ctccagagcg ccctccacct tcagcctcaa gcagcgaatc atgattgcaa 34500 
aaattcaggt tcctcacaga cctgtataag attcaaaagc ggaacattaa caaaaatacc 34560 
gcgatcccgt aggtcccttc gcagggccag ctgaacataa tcgtgcaggt ctgcacggac 34620 
cagcgcggcc acttccccgc caggaacctt gacaaaagaa cccacactga ttatgacacg 34680 
catactcgga gctatgctaa ccagcgtagc cccgatgtaa gctttgttgc atgggcggcg 34740 
atataaaatg caaggtgctg ctcaaaaaat caggcaaagc ctcgcgcaaa aaagaaagca 34800 
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catcgtagtc atgctcatgc agataaaggc 

acaccatttt tctctcaaac atgtctgcgg 

aaaaaacatt taaacattag aagcctgtct 

aagacggact acggccatgc cggcgtgacc 

caccaccgac agctcctcgg tcatgtccgg 

aggttgattc atcggtcagt gctaaaaagc 

gcaggcgtag agacaacatt acagccccca 

aaaacacata aacacctgaa aaaccctcct 

gaacaacata cagcgcttca cagcggcagc 

aacctattaa aaaaacacca ctcgacacgg 

gggccaagtg cagagcgagt atatatagga 

aaaaaacacc cagaaaaccg cacgcgaacc 

caacttcctc aaatcgtcac ttccgttttc 

actacaattc ccaacacata caagttactc 

cccacgcccc gcgccacgtc acaaactcca 

aaataaggta tattattgat gatg 

<210> 14 
<211> 33988 
<212> DNA 

<213> Adenovirus subgroup C 



aggtaagctc cggaaccacc acagaaaaag 34860 
gtttctgcat aaacacaaaa taaaataaca 34 920 
tacaacagga aaaacaaccc ttataagcat 34980 
gtaaaaaaac tggtcaccgt gattaaaaag 35040 
agtcataatg taagactcgg taaacacatc 35100 
gaccgaaata gcccggggga atacataccc 35160 
taggaggtat aacaaaatta ataggagaga 35220 
gcctaggcaa aatagcaccc tcccgctcca 35280 
ctaacagtca gccttaccag taaaaaagaa 35340 
caccagctca atcagtcaca gtgtaaaaaa 35400 
ctaaaaaatg acgtaacggt taaagtccac 354 60 
tacgcccaga aacgaaagcc aaaaaaccca 35520 
ccacgttacg taacttccca ttttaagaaa 35580 
cgccctaaaa cctacgtcac ccgccccgtt 35640 
ccccctcatt atcatattgg cttcaatcca 35700 

35724 



<400> 14 

catcatcaat aatatacctt attttggatt 
ttgtgacgtg gcgcggggcg tgggaacggg 
gatgttgcaa gtgtggcgga acacatgtaa 
gtgtgcgccg gtgtacacag gaagtgacaa 
taaatttggg cgtaaccgag taagatttgg 
agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatga 
ccattttgaa ccacctaccc ttcacgaact 
tcccaacgag gaggcggttt cgcagatttt 
agggattgac ttactcactt ttccgccggc 
ccggcagccc gagcagccgg agcagagagc 
tccacccagt gacgacgagg atgaagaggg 
ccccgggcac ggttgcaggt cttgtcatta 
tatgtgttcg ctttgctata tgaggacctg 
atgggcagtg ggtgatagag tggtgggttt 
gttttgtggt ttaaagaatt ttgtattgtg 
gagcctgagc ccgagccaga accggagcct 
cctgctatcc tgagacgccc gacatcacct 
agctgtgact ccggtccttc taacacacct 
cccattaaac cagttgccgt gagagttggt 
gacttgctta acgagcctgg gcaacctttg 
ggtgtaaacc tgtgattgcg tgtgtggtta 
agtttaataa agggtgagat aatgtttaac 
aaagggtata taatgcgccg tgggctaatc 
gagtgtttgg aagatttttc tgctgtgcgt 
tcttggtttt ggaggtttct gtggggctca 
gaggattaca agtgggaatt tgaagagctt 
ttgaatctgg gtcaccaggc gcttttccaa 
acaccggggc gcgctgcggc tgctgttgct 
gaagaaaccc atctgagcgg ggggtacctg 
gcggttgtga gacacaagaa tcgcctgcta 
ccgacggagg agcagcagca gcagcaggag 
ccatggaacc cgagagccgg cctggaccct 
tgtatccaga actgagacgc attttgacaa 
taaagaggga gcggggggct tgtgaggcta 
taatgaccag acaccgtcct gagtgtatta 
atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 240 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 1440 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 1740 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 24 60 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
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atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatqtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 
gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 
gggcagcagg cgcgcgtcga agtagtctat 



gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 2760 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgjca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
taggggtttt gcgcgcgcgg taggcccggg 4020 
gtattttttc caggacgtgg taaaggtgac 4080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
gcatacgtgg ggatatgaga tgcatcttgg 4320 
ccatatccct ccggggattc atgttgtgca 4380 
tgggaaattt gtcatgtagc ttagaaggaa 4 440 
gacctccaag attttccatg cattcgtcca 4500 
cctgggcgaa gatatttctg ggatcactaa 4560 
cataggccat ttttacaaag cgcgggcgga 4 620 
ccggcccagg ggcgtagtta ccctcacaga 4 680 
gggggatcat gtctacctgc ggggcgatga 4740 
gctgggaaga aagcaggttc ctgagcagct 4800 
tcacacctat taccgggtgc aactggtagt 4860 
gcaggggggc cacttcgtta agcatgtccc 4 920 
ccagaaggcg ctcgccgccc agcgatagca 4 980 
gtttgagacc gtccgccgta ggcatgcttt 5040 
cccacagctc ggtcacctgc tctacggcat 5100 
gttggggcgg ctttcgctgt acggcagtag 5160 
gtctttccac gggcgcaggg tcctcgtcag 5220 
tccgggctgc gcgctggcca gggtgcgctt 5280 
ccggtcttcg ccctgcgcgt cggccaggta 5340 
ctccgcggcg tggcccttgg cgcgcagctt 5400 
gtgcagactt ttgagggcgt agagcttggg 5460 
atccgcgccg caggccccgc agacggtctc 5520 
ttcggggtca aaaaccaggt ttcccccatg 5580 
catgagccgg tgtccacgct cggtgacgaa 5640 
gagaggcctg tcctcgagcg gtgttccgcg 5700 
tgagacaaag gctcgcgtcc aggccagcac 5760 
gttgtccact agggggtcca ctcgctccag 5820 
atcaaggaag gtgattggtt tgtaggtgta 5880 
gctataaaag ggggtggggg cgcgttcgtc 5940 
ggccagctgt tggggtgagt actccctctg 6000 
gtcagtttcc aaaaacgagg aggatttgat 6060 
ggtggccgca tccatctggt cagaaaagac 6120 
cgacccgtag agggcgttgg acagcaactt 6180 
gcgatcggcg cgctccttgg ccgcgatgtt 6240 
ccattcggga aagacggtgg tgcgctcgtc 6300 
gtgcagggtg acaaggtcaa cgctggtggc 6360 
gcagaggcgg ccgcccttgc gcgagcagaa 6420 
cggggggtct gcgtccacgg taaagacccc 6480 
cttgcatcct tgcaagtcta gcgcctgctg 6540 
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ccatgcgcgg gcggcaagcg cgcgctcgta tgggttgagt gggggacccc atggcatggg 6600 
qtgggtgagc gcggaggcgt acatgccgca aatgtcgtaa acgtagaggg gctctctgag 6660 
tattccaaga tatgtagggt agcatcttcc accgcggatg ctggcgcgca cgtaatcgta 6720 
tagttcgtgc gagggagcga ggaggtcggg accgaggttg ctacgggcgg gctgctctgc 6780 
tcggaagact atctgcctga agatggcatg tgagttggat gatatggttg gacgctggaa 6840 
gacgttgaag ctggcgtctg tgagacctac cgcgtcacgc acgaaggagg cgtaggagtc 6900 
gcgcagcttg ttgaccagct cggcggtgac ctgcacgtct agggcgcagt agtccagggt 6960 
ttccttgatg atgtcatact tatcctgtcc cttttttttc cacagctcgc ggttgaggac 7020 
aaactcttcg cggtctttcc agtactcttg gatcggaaac cxgtcggcct ccgaacggta 7080 
agagcctagc atgtagaact ggttgacggc ctggtaggcg cagcatccct tttctacggg 7140 
tagcgcgtat gcctgcgcgg ccttccggag cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gaccatgact ttgaggtact ggtatttgaa gtcagtgtcg tcgcatccgc cctgctccca 7260 
gagcaaaaag tccgtgcgct ttttggaacg cggatttggc agggcgaagg tgacatcgtt 7320 
gaagagtatc tttcccgcgc gaggcataaa gttgcgtgtg atgcggaagg gtcccggcac 7380 
ctcggaacgg ttgttaatta cctgggcggc gagcacgatc tcgtcaaagc cgttgatgtt 7440 
gtggcccaca atgtaaagtt ccaagaagcg cgggatgccc ttgatggaag gcaatttttt 7500 
aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 7560 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 7620 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 7680 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 7740 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7920 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 7980 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 804 0 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 8340 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 8460 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8640 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 8700 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8940 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 9000 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 , 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 9420 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 9480 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 9720 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
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gagcctgtaa gcgggcactc ttccgtggtc 

ggacgaccgg ggttcgagcc ccgtatccgg 

cgcgtgtcga acccaggtgt gcgacgtcag 

tccaggcgcg gcggctgctg cgctagcttt 

gttaggctgg aaagcgaaag cattaagtgg 

caagggttga gtcgcgggac ccccggttcg 

ggggtttgcc tccccgtcat gcaagacccc 

gccccttttt tgcttttccc agatgcatcc 

gcagcggcaa gagcaagagc agcggcagac 

gtcaggaggg gcgacatccg cggttgacgc 

gcgccgggcc cggcactacc tggacttgga 

gccctctcct gagcggtacc caagggtgca 

gccgcggcag aacctgtttc gcgaccgcga 

aaagttccac gcagggcgcg agctgcggca 

ggaggacttt gagcccgacg cgcgaaccgg 

cgccgacctg gtaaccgcat acgagcagac 

ctttaacaac cacgtgcgta cgcttgtggc 

tctgtgggac tttgtaagcg cgctggagca 

gctgttcctt atagtgcagc acagcaggga 

catagtagag cccgagggcc gctggctgct 

ggtgcaggag cgcagcttga gcctggctga 

tagcctgggc aagttttacg cccgcaagat 

ggaggtaaag atcgaggggt tctacatgcg 

cgacctgggc gtttatcgca acgagcgcat 

cgagctcagc gaccgcgagc tgatgcacag 

cggcgataga gaggccgagt cctactttga 

ccgacgcgcc ctggaggcag ctggggccgg 

tggcaacgtc ggcggcgtgg aggaatatga 

cgagtactaa gcggtgatgt ttctgatcag 

cgggcggcgc tgcagagcca gccgtccggc 

atggaccgca tcatgtcgct gactgcgcgc 

gccaaccggc tctccgcaat tctggaagcg 

gagaaggtgc tggcgatcgt aaacgcgctg 

gccggcctgg tctacgacgc gctgcttcag 

cagaccaacc tggaccggct ggtgggggat 

gcgcagcagc agggcaacct gggctccatg 

cccgccaacg tgccgcgggg acaggaggac 

atggtgactg agacaccgca aagtgaggtg 

accagtagac aaggcctgca gaccgtaaac 

ctgtgggggg tgcgggctcc cacaggcgac 

aactcgcgcc tgttgctgct gctaatagcg 

gacacatacc taggtcactt gctgacactg 

gacgagcata ctttccagga gattacaagt 

ggcagcctgg aggcaaccct aaactacctg 

ttgcacagtt taaacagcga ggaggagcgc 

cttaacctga tgcgcgacgg ggtaacgccc 

atggaaccgg gcatgtatgc ctcaaaccgg 

catcgcgcgg ccgccgtgaa ccccgagtat 

ctaccgcccc ctggtttcta caccggggga 

ctctgggacg acatagacga cagcgtgttt 

caacagcgcg agcaggcaga ggcggcgctg 

ttgtccgatc taggcgctgc ggccccgcgg 

atagggtctc ttaccagcac tcgcaccacc 

ctaaacaact cgctgctgca gccgcagcgc 

aacgggatag agagcctagt ggacaagatg 

agggacgtgc caggcccgcg cccgcccacc 

ctggtgtggg aggacgatga ctcggcagac 

ggcaacccgt ttgcgcacct tcgccccagg 

atgatgcaaa ataaaaaact caccaaggcc 

cccttagtat gcggcgcgcg gcgatgtatg 

tggtgagcgc ggcgccagtg gcggcggcgc 

cgccgtttgt gcctccgcgg tacctgcggc 

ctgagttggc acccctattc gacaccaccc 

atgtggcatc cctgaactac cagaacgacc 

acaatgacta cagcccgggg gaggcaagca 

actggggcgg cgacctgaaa accatcctgc 



tggtggataa attcgcaagg gtatcatggc 10560 
ccgtccgccg tgatccatgc ggttaccgcc 10620 
acaacggggg agtgctcctt ttggcttcct 10680 
tttggccact ggccgcgcgc agcgtaagcg 10740 
ctcgctccct gtagccggag ggttattttc 10800 
agtctcggac cggccggact gcggcgaacg 10860 
gcttgcaaat tcctccggaa acagggacga 10920 
ggtgctgcgg cagatgcgcc cccctcctca 10980 
atgcagggca ccctcccctc ctcctaccgc 11040 
ggcagcagat ggtgattacg aacccccgcg 11100 
ggagggcgag ggcctggcgc ggctaggagc 11160 
gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gggagaggag cccgaggaga tgcgggatcg 11280 
tggcctgaat cgcgagcggt tgctgcgcga 11340 
gattagtccc gcgcgcgcac acgtggcggc 11400 
ggtgaaccag gagattaact ttcaaaaaag 11460 
gcgcgaggag gtggctatag gactgatgca 11520 
aaacccaaat agcaagccgc tcatggcgca 11580 
caacgaggca ttcagggatg cgctgctaaa 11640 
cgatttgata aacatcctgc agagcatagt 11700 
caaggtggcc gccatcaact attccatgct 11760 
ataccatacc ccttacgttc ccatagacaa 11820 
catggcgctg aaggtgctta ccttgagcga 11880 
ccacaaggcc gtgagcgtga gccggcggcg 11940 
cctgcaaagg gccctggctg gcacgggcag 12000 
cgcgggcgct gacctgcgct gggccccaag 12060 
acctgggctg gcggtggcac ccgcgcgcgc 12120 
cgaggacgat gagtacgagc cagaggacgg 12180 
atgatgcaag acgcaacgga cccggcggtg 12240 
cttaactcca cggacgactg gcgccaggtc 12300 
aatcctgacg cgttccggca gcagccgcag 12360 
gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gccgaaaaca gggccatccg gcccgacgag 12480 
cgcgtggctc gttacaacag cggcaacgtg 12540 
gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gttgcactaa acgccttcct gagtacacag 12660 
tacaccaact ttgtgagcgc actgcggcta 12720 
taccagtctg ggccagacta ttttttccag 12780 
ctgagccagg ctttcaaaaa cttgcagggg 12840 
cgcgcgaccg tgtctagctt gctgacgccc 12900 
cccttcacgg acagtggcag cgtgtcccgg 12960 
taccgcgagg ccataggtca ggcgcatgtg 13020 
gtcagccgcg cgctggggca ggaggacacg 13080 
ctgaccaacc ggcggcagaa gatcccctcg 13140 
attttgcgct acgtgcagca gagcgtgagc 13200 
agcgtggcgc tggacatgac cgcgcgcaac 13260 
ccgtttatca accgcctaat ggactacttg 13320 
ttcaccaatg ccatcttgaa cccgcactgg 13380 
ttcgaggtgc ccgagggtaa cgatggattc 134 40 
tccccgcaac cgcagaccct gctagagttg 13500 
cgaaaggaaa gcttccgcag gccaagcagc 13560 
tcagatgcta gtagcccatt tccaagcttg 13620 
cgcccgcgcc tgctgggcga ggaggagtac 13680 
gaaaaaaacc tgcctccggc atttcccaac 13740 
agtagatgga agacgtacgc gcaggagcac 13800 
cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
gacagcagcg tcctggattt gggagggagt 13920 
ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atggqaccga gcgttggttt tcttgtattc 14040 
aggaaggtcc tcctccctcc tacgagagtg 14100 
tgggttctcc cttcgatgct cccctggacc 14160 
ctaccggggg gagaaacagc atccgttact 14220 
gtgtgtacct ggtggacaac aagtcaacgg 14280 
acagcaactt tctgaccacg gtcattcaaa 14340 
cacagaccat caatcttgac gaccggtcgc 14400 
ataccaacat gccaaatgtg aacgagttca 14460 
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tgtttaccaa taagtttaag gcgcgggtga 
aggtggagct gaaatacgag tgggtggagt 
ccatgaccat agaccttatg aacaacgcga 
agaacggggt tctggaaagc gacatcgggg 
ggtttgaccc cgtcactggt cttgtcatgc 
cagacatcat tttgctgcca ggatgcgggg 
tgttgggcat ccgcaagcgg caacccttcc 
tggagggtgg taacattccc gcactgttgg 
atgacaccga acagggcggg ggtggcgcag 
aagagaactc caacgcggca gccgcggcaa 
ccattcgcgg cgacaccttt gccacacggg 
cggccgaagc tgccgccccc gctgcgcaac 
tgatcaaacc cctgacagag gacagcaaga 
gcaccttcac ccagtaccgc agctggtacc 
gaatccgctc atggaccctg ctttgcactc 
actggtcgtt gccagacatg atgcaagacc 
gcaactttcc ggtggtgggc gccgagctgt 
accaggccgt ctactcccaa ctcatccgcc 
gctttcccga gaaccagatt ttggcgcgcc 
aaaacgttcc tgctctcaca gatcacggga 
tccagcgagt gaccattact gacgccagac 
tgggcatagt ctcgccgcgc gtcctatcga 
ttatatcgcc cagcaataac acaggctggg 
gggccaagaa gcgctccgac caacacccag 
ggggcgcgca caaacgcggc cgcactgggc 
tggtggagga ggcgcgcaac tacacgccca 
ccattcagac cgtggtgcgc ggagcccggc 
gcgtagcacg tcgccaccgc cgccgacccg 
tgcttaaccg cgcacgtcgc accggccgac 
ccgcgggtat tgtcactgtg ccccccaggt 
cggccattag tgctatgact cagggtcgca 
ttagcggcct gcgcgtgccc gtgcgcaccc 
actacttaga ctcgtactgt tgtatgtatc 
ccaagcgcaa aatcaaagaa gagatgctcc 
cgaagaagga agagcaggat tacaagcccc 
aagatgatga tgatgaactt gacgacgagg 
gacgggtaca gtggaaaggt cgacgcgtaa 
tctttacgcc cggtgagcgc tccacccgca 
gcgacgagga cctgcttgag caggccaacg 
ggcataagga catgctggcg ttgccgctgg 
ccgtaacact gcagcaggtg ctgcccgcgc 
agcgcgagtc tggtgacttg gcacccaccg 
tggaagatgt cttggaaaaa atgaccgtgg 
ggccaatcaa gcaggtggcg ccgggactgg 
ctaccagtag caccagtatt gccaccgcca 
ttgcctcagc ggtggcggat gccgcggtgc 
ctacggaggt gcaaacggac ccgtggatgt 
gttcgaggaa gtacggcgcc gccagcgcgc 
ttgcgcctac ccccggctat cgtggctaca 
gacgccgaac caccactgga acccgccgcc 
cgatttccgt gcgcagggtg gctcgcgaag 
' gctaccaccc cagcatcgtt taaaagccgg 
cctgccgcct ccgtttcccg gtgccgggat 
tggccggcca cggcctgacg ggcggcatgc 
cgcaccgtcg catgcgcggc ggtatcctgc 
ttggcgccgt gcccggaatt gcatccgtgg 
caagttgcat gtggaaaaat caaaataaaa 
taactatttt gtagaatgga agacatcaac 
cgcccgttca tgggaaactg gcaagatatc 
agctggggct cgctgtggag cggcattaaa 
agcaaggcct ggaacagcag cacaggccag 
ttccaacaaa aggtggtaga tggcctggcc 
aaccaggcag tgcaaaataa gattaacagt 
cctccaccgg ccgtggagac agtgtctcca 
gacagggaag aaactctggt gacgcaaata 
aagcaaggcc tgcccaccac ccgtcccatc 



tggtgtcgcg cttgcctact aaggacaatc 14520 
tcacgctgcc cgagggcaac tactccgaga 14580 
tcgtggagca ctacttgaaa gtgggcagac 14 640 
taaagtttga cacccgcaac ttcagactgg 14700 
ctggggtata tacaaacgaa gccttccatc 14760 
tggacttcac ccacagccgc ctgagcaact 14820 
aggagggctt taggatcacc tacgatgatc 14880 
atgtggacgc ctaccaggcg agcttgaaag 14940 
gcggcagcaa cagcagtggc agcggcgcgg 15000 
tgcagccggt ggaggacatg aacgatcatg 15060 
ctgaggagaa gcgcgctgag gccgaagcag 15120 
ccgaggtcga gaagcctcag aagaaaccgg 15180 
aacgcagtta caacctaata agcaatgaca 15240 
ttgcatacaa ctacggcgac cctcagaccg 15300 
ctgacgtaac ctgcggctcg gagcaggtct 15360 
ccgtgacctt ccgctccacg cgccagatca 15420 
tgcccgtgca ctccaagagc ttctacaacg 15480 
agtttacctc tctgacccac gtgttcaatc 15540 
cgccagcccc caccatcacc accgtcagtg 15600 
cgctaccgct gcgcaacagc atcggaggag 15660 
gccgcacctg cccctacgtt tacaaggccc 15720 
gccgcacttt ttgagcaagc atgtccatcc 15780 
gcctgcgctt cccaagcaag atgtttggcg 15840 
tgcgcgtgcg cgggcactac cgcgcgccct 15900 
gcaccaccgt cgatgacgcc atcgacgcgg 15960 
cgccgccacc agtgtccaca gtggacgcgg 16020 
gctatgctaa aatgaagaga cggcggaggc 16080 
gcactgccgc ccaacgcgcg gcggcggccc 16140 
gggcggccat gcgggccgct cgaaggctgg 16200 
ccaggcgacg agcggccgcc gcagcagccg 16260 
ggggcaacgt gtattgggtg cgcgactcgg 16320 
gccccccgcg caactagatt gcaagaaaaa 16380 
cagcggcggc ggcgcgcaac gaagctatgt 16440 
aggtcatcgc gccggagatc tatggccccc 16500 
gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
tggaactgct gcacgctacc gcgcccaggc 16620 
aacgtgtttt gcgacccggc accaccgtag 16680 
cctacaagcg cgtgtatgat gaggtgtacg 16740 
agcgcctcgg ggagtttgcc tacggaaagc 16800 
acgagggcaa cccaacacct agcctaaagc 16860 
ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
tgcagctgat ggtacccaag cgccagcgac 16980 
aacctgggct ggagcccgag gtccgcgtgc 17040 
gcgtgcagac cgtggacgtt cagataccca 17100 
cagagggcat ggagacacaa acgtccccgg 17160 
aggcggtcgc tgcggccgcg tccaagacct 17220 
ttcgcgtttc agccccccgg cgcccgcgcg 17280 
tactgcccga atatgcccta catccttcca 17340 
cctaccgccc cagaagacga gcaactaccc 17400 
gccgtcgccg tcgccagccc gtgctggccc 17460 
gaggcaggac cctggtgctg ccaacagcgc 17520 
tctttgtggt tcttgcagat atggccctca 17580 
tccgaggaag aatgcaccgt aggaggggca 17640 
gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
ccctccttat tccactgatc gccgcggcga 17760 
ccttgcaggc gcagagacac tgattaaaaa 17820 
agtctggact ctcacgctcg cttggtcctg 17880 
tttgcgtctc tggccccgcg acacggctcg 17940 
ggcaccagca atatgagcgg tggcgccttc 18000 
aatttcggtt ccaccgttaa gaactatggc 18060 
atgctgaggg ataagttgaa agagcaaaat 18120 
tctggcatta gcggggtggt ggacctggcc 18180 
aagcttgatc cccgccctcc cgtagaggag 18240 
gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacgagcctc cctcgtacga ggaggcacta 18360 
gcgcccatgg ctaccggagt gctgggccag 18420 
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cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 184 80 
ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 
atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 
tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 
gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 
cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 
agacgtactt cagcctgaat aacaagttta gaaaccccac ggtggcgcct acgcacgacg 18900 
tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 
ctgcgtactc gtacaaggcg cggttcaccc tagctgtggg tgataaccgt gtgctggaca 19020 
tggcttccac gtactttgac atccgcggcg tgctggacag gggccctact tttaagccct 19080 
actctggcac tgcctacaac gccctggctc ccaagggtgc cccaaatcct tgcgaatggg 19140 
atgaagctgc tactgctctt gaaataaacc tagaagaaga ggacgatgac aacgaagacg 19200 
aagtagacga gcaagctgag cagcaaaaaa ctcacgtatt tgggcaggcg ccttattctg 19260 
gtataaatat tacaaaggag ggtattcaaa taggtgtcga aggtcaaaca cctaaatatg 19320 
ccgataaaac atttcaacct gaacctcaaa taggagaatc tcagtggtac gaaactgaaa 19380 
ttaatcatgc agctgggaga gtccttaaaa agactacccc aatgaaacca tgttacggtt 194 40 
catatgcaaa acccacaaat gaaaatggag ggcaaggcat tcttgtaaag caacaaaatg 19500 
gaaagctaga aagtcaagtg gaaatgcaat ttttctcaac tactgaggcg accgcaggca 19560 
atggtgataa cttgactcct aaagtggtat tgtacagtga agatgtagat atagaaaccc 19620 
cagacactca tatttcttac atgcccacta ttaaggaagg taactcacga gaactaatgg 19680 
gccaacaatc tatgcccaac aggcctaatt acattgcttt tagggacaat tttattggtc 19740 
taatgtatta caacagcacg ggtaatatgg gtgttctggc gggccaagca tcgcagttga 19800 
atgctgttgt agatttgcaa gacagaaaca cagagctttc ataccagctt ttgcttgatt 19860 
ccattggtga tagaaccagg tacttttcta tgtggaatca ggctgttgac agctatgatc 19920 
cagatgttag aattattgaa aatcatggaa ctgaagatga acttccaaat tactgctttc 19980 
cactgggagg tgtgattaat acagagactc ttaccaaggt aaaacctaaa acaggtcagg 20040 
aaaatggatg ggaaaaagat gctacagaat tttcagataa aaatgaaata agagttggaa 20100 
ataattttgc catggaaatc aatctaaatg ccaacctgtg gagaaatttc ctgtactcca 20160 
acatagcgct gtatttgccc gacaagctaa agtacagtcc ttccaacgta aaaatttctg 20220 
ataacccaaa cacctacgac tacatgaaca agcgagtggt ggctcccggg ttagtggact 20280 
gctacattaa ccttggagca cgctggtccc ttgactatat ggacaacgtc aacccattta 20340 
accaccaccg caatgctggc ctgcgctacc gctcaatgtt gctgggcaat ggtcgctatg 20400 
tgcccttcca catccaggtg cctcagaagt tctttgccat taaaaacctc cttctcctgc 204 60 
cgggctcata cacctacgag tggaacttca ggaaggatgt taacatggtt ctgcagagct 20520 
ccctaggaaa tgacctaagg gttgacggag ccagcattaa gtttgatagc atttgccttt 20580 
acgccacctt cttccccatg gcccacaaca ccgcctccac gcttgaggcc atgcttagaa 20640 
acgacaccaa cgaccagtcc tttaacgact atctctccgc cgccaacatg ctctacccta 20700 
tacccgccaa cgctaccaac gtgcccatat ccatcccctc ccgcaactgg gcggctttcc 20760 
gcggctgggc cttcacgcgc cttaagacta aggaaacccc atcactgggc tcgggctacg 20820 
acccttatta cacctactct ggctctatac cctacctaga tggaaccttt tacctcaacc 20880 
acacctttaa gaaggtggcc attacctttg actcttctgt cagctggcct ggcaatgacc 20940 
gcctgcttac ccccaacgag tttgaaatta agcgctcagt tgacggggag ggttacaacg 21000 
ttgcccagtg taacatgacc aaagactggt tcctggtaca aatgctagct aactacaaca 21060 
ttggctacca gggcttctat atcccagaga gctacaagga ccgcatgtac tccttcttta 21120 
gaaacttcca gcccatgagc cgtcaggtgg tggatgatac taaatacaag gactaccaac 21180 
aggtgggcat cctacaccaa cacaacaact ctggatttgt tggctacctt gcccccacca 21240 
tgcgcgaagg acaggcctac cctgctaact tcccctatcc gcttataggc aagaccgcag 21300 
ttgacagcat tacccagaaa aagtttcttt gcgatcgcac cctttggcgc atcccattct 21360 
ccagtaactt tatgtccatg ggcgcactca cagacctggg ccaaaacctt ctctacgcca 21420 
actccgccca cgcgctagac atgacttttg aggtggatcc catggacgag cccacccttc 21460 
tttatgtttt gtttgaagtc tttgacgtgg tccgtgtgca ccggccgcac cgcggcgtca 21540 
tcgaaaccgt gtacctgcgc acgcccttct cggccggcaa cgccacaaca taaagaagca 21600 
agcaacatca acaacagctg ccgccatggg ctccagtgag caggaactga aagccattgt 21660 
caaagatctt ggttgtgggc catatttttt gggcacctat gacaagcgct ttccaggctt 21720 
tgtttctcca cacaagctcg cctgcgccat agtcaatacg gccggtcgcg agactggggg 21780 
cgtacactgg atggcctttg cctggaaccc gcactcaaaa acatgctacc tctttgagcc 21840 
ctttggcttt tctgaccagc gactcaagca ggtttaccag tttgagtacg agtcactcct 21900 
gcgccgtagc gccattgctt cttcccccga ccgctgtata acgctggaaa agtccaccca 21960 
aagcgtacag gggcccaact cggccgcctg tggactattc tgctgcatgt ttctccacgc 22020 
ctttgccaac tggccccaaa ctcccatgga tcacaacccc accatgaacc ttattaccgg 22080 
ggtacccaac tccatgctca acagtcccca ggtacagccc accctgcgtc gcaaccagga 22140 
acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
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cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22 440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22 680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 234 60 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 
gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 
ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 
gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 
cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 
gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag. 23940 
gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 
cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 
cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 
cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 
cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 
cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 
accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24420 
cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 
ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 
gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24 600 
gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24 660 
tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 
aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 
catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 
aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24900 
ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24 960 
agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 
gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 
caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 
ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 
ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 
gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 
ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 
ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 25440 
gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 
tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 
cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 
ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 
gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 
gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 
gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 
gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 
aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 
gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 
agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 
aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 
aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 
cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 
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cgccgccggc actgcccgtt cgccgaccca 
ccggtaagtc caagcagccg ccgccgttag 
gctcatggcg cgggcacaag aacgccatag 
tctccttcgc ccgccgcttt cttctctacc 
tgcattacta ccgtcatctc tacagcccat 
acagcagcgg ccacacagaa gcaaaggcga 
aaatccacag cggcggcagc agcaggagga 
gtatcgaccc gcgagcttag aaacaggatt 
agcaggggcc aagaacaaga gctgaaaata 
agctgcctgt atcacaaaag cgaagatcag 
ctcttcagta aatactgcgc gctgactctt 
taagcgcgaa aactacgtca tctccagcgg 
gccattatga gcaaggaaat tcccacgccc 
cttgcggctg gagctgccca agactactca 
cacatgatat cccgggtcaa cggaatccgc 
gcggctatta ccaccacacc tcgtaataac 
gtgtaccagg aaagtcccgc tcccaccact 
gttcagatga ctaactcagg ggcgcagctt 
cccgggcagg gtataactca cctgacaatc 
tcggtgagct cctcgcttgg tctccgtccg 
cgtccttcat tcacgcctcg tcaggcaatc 
cgctctggag gcattggaac tctgcaattt 
aaccccttct cgggacctcc cggccactat 
gtaaaggact cggcggacgg ctacgactga 
ctgaaacacc tggtccactg tcgccgccac 
tgctactttg aattgcccga ggatcatatc 
tataataaat acagaaatta aaatatactg 
cgtcttcacc cgcccaagca aaccaaggcg 
ctctgtgatt tacaacagtt tcaacccaga 
gctcagctac tccatcagaa aaaacaccac 
ttaaaagtca ggcttcctgg atgtcagcat 
ttgttccagt ccaactacag cgacccaccc 
ccgccgctac cggacttaca tctaccacaa 
actgggataa cttgggcatg tggtggttct 
ttatgtggct catctgctgc ctaaagcgca 
tcattgtgct acacccaaac aatgatggaa 
tcttttctct tacagtatga ttaaatgaga 
agcagcacct ccttgccctc ctcccagctc 
tttctccaca atctaaatgg aatgtcagtt 
atcttcatgt tgttgcagat gaagcgcgca 
tatccatatg acacggaaac cggtcctcca 
tcccccaatg ggtttcaaga gagtccccct 
ctagttacct ccaatggcat gcttgcgctc 
gccggcaacc ttacctccca aaatgtaacc 
tcaaacataa acctggaaat atctgcaccc 
gctgccgccg cacctctaat ggtcgcgggc 
ctaaccgtgc acgactccaa acttagcatt 
ggaaagctag ccctgcaaac atcaggcccc 
atcactgcct caccccctct aactactgcc 
cccatttata cacaaaatgg aaaactagga 
gacgacctaa acactttgac cgtagcaact 
ttgcaaacta aagttactgg agccttgggt 
gtagcaggag gactaaggat tgattctcaa 
ccgtttgatg ctcaaaacca actaaatcta 
tcagcccaca acttggatat taactacaac 
aattccaaaa agcttgaggt taacctaagc 
gccatagcca ttaatgcagg agatgggctt 
aatcccctca aaacaaaaat tggccatggc 
cctaaactag gaactggcct tagttttgac 
aataatgata agctaacttt gtggaccaca 
gcagagaaag atgctaaact cactttggtc 
acagtttcag ttttggctgt taaaggcagt 
gctcatctta ttataagatt tgacgaaaat 
ccagaatatt ggaactttag aaatggagat 
gttggattta tgcctaacct atcagcttat 
aacattgtca gtcaagttta cttaaacgga 



accgtagatg ggacaccact ggaaccaggg 26400 
cccaagagca acaacagcgc caaggctacc 26460 
ttgcttgctt gcaagactgt gggggcaaca 26520 
atcacggcgt ggccttcccc cgtaacatcc 26580 
actgcaccgg cggcagcggc agcggcagca 26640 
ccggatagca agactctgac aaagcccaag 26700 
ggagcgctgc gtctggcgcc caacgaaccc 26760 
tttcccactc tgtatgctat atttcaacag 26820 
aaaaacaggt ctctgcgatc cctcacccgc 26880 
cttcggcgca cgctggaaga cgcggaggct 26940 
aaggactagt ttcgcgccct ttctcaaatt 27000 
ccacacccgg cgccagcacc tgtcgtcagc 27060 
tacatgtgga gttaccagcc acaaatggga 27120 
acccgaataa actacatgag cgcgggaccc 27180 
gcccaccgaa accgaattct cttggaacag 27240 
cttaatcccc gtagttggcc cgctgccctg 27300 
gtggtacttc ccagagacgc ccaggccgaa 27360 
gcgggcggct ttcgtcacag ggtgcggtcg 27420 
agagggcgag gtattcagct caacgacgag 27480 
gacgggacat ttcagatcgg cggcgccggc 27540 
ctaactctgc agacctcgtc ctctgagccg 27 600 
attgaggagt ttgtgccatc ggtctacttt 27660 
ccggatcaat ttattcctaa ctttgacgcg 27720 
taattaagtg gagaggcaga gcaactgcgc 27780 
aagtgctttg cccgcgactc cggtgagttt 27840 
gaggatcttt gttgccatct ctgtgctgag 27900 
gggctcctat cgccatcctg taaacgccac 27960 
aaccttacct ggtactttta acatctctcc 28020 
cggagtgagt ctacgagaga acctctccga 28080 
cctccttacc tgccgggaac gtacccttaa 28140 
ctgactttgg ccagcacctg tcccgcggat 28200 
taacagagat gaccaacaca accaacgcgg 28260 
atacacccca agtttctgcc tttgtcaata 28320 
ccatagcgct tatgtttgta tgccttatta 28380 
aacgcgcccg accacccatc tatagtccca 284 40 
tccatagatt ggacggactg aaacacatgt 28500 
ttaattaagg aatttctgtc cagtttattc 28560 
tggtattgca gcttcctcct ggctgcaaac 28620 
tcctcctgtt cctgtccatc cgcacccact 26680 
agaccgtctg aagatacctt caaccccgtg 28740 
actgtgcctt ttcttactcc tccctttgta 28800 
ggggtactct ctttgcgcct atccgaacct 28860 
aaaatgggca acggcctctc tctggacgag 28920 
actgtgagcc cacctctcaa aaaaaccaag 28980 
ctcacagtta cctcagaagc cctaactgtg 29040 
aacacactca ccatgcaatc acaggccccg 29100 
gccacccaag gacccctcac agtgtcagaa 29160 
ctcaccacjca ccgatagcag tacccttact 29220 
actggtagct tgggcattga cttgaaagag 29280 
ctaaagtacg gggctccttt gcatgtaaca 29340 
ggtccaggtg tgactattaa taatacttcc 29400 
tttgattcac aaggcaatat gcaacttaat 29460 
aacagacgcc ttatacttga tgttagttat 29520 
agactaggac agggccctct ttttataaac 29580 
aaaggccttt acttgtttac agcttcaaac 29640 
actgccaagg ggttgatgtt tgacgctaca 29700 
gaatttggtt cacctaatgc accaaacaca 29760 
ctagaatttg attcaaacaa ggctatggtt 29820 
agcacaggtg ccattacagt aggaaacaaa 29880 
ccagctccat ctcctaactg tagactaaat 29940 
ttaacaaaat gtggcagtca aatacttgct 30000 
ttggctccaa tatctggaac agttcaaagt 30060 
ggagtgctac taaacaattc cttcctggac 30120 
cttactgaag gcacagccta tacaaacgct 30180 
ccaaaatctc acggtaaaac tgccaaaagt 30240 
gacaaaacta aacctgtaac actaaccatt 30300 
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acactaaacg gtacacagga aacaggagac acaactccaa gtgcatactc tatgtcattt 30360 
tcatgggact ggtctggcca caactacatt aatgaaatat ttgccacatc ctcttacact 30420 
ttttcataca ttgcccaaga ataaagaatc gtttgtgtta tgtttcaacg tgtttatttt 30480 
tcaattgcag aaaatttcaa gtcatttttc attcagtagt atagccccac caccacatag 30540 
cttatacaga tcaccgtacc ttaatcaaac tcacagaacc ctagtattca acctgccacc 30600 
tccctcccaa cacacagagt acacagtcct ttctccccgg ctggccttaa aaagcatcat 30660 
atcatgggta acagacatat tcttaggtgt tatattccac acggtttcct gtcgagccaa 30720 
acgctcatca gtgatattaa taaactcccc gggcagctca cttaagttca tgtcgctgtc 30780 
cagctgctga gccacaggct gctgtccaac ttgcggttgc ttaacgggcg gcgaaggaga 30840 
agtccacgcc tacatggggg tagagtcata atcgtgcatc aggatagggc ggtggtgctg 30900 
cagcagcgcg cgaataaact gctgccgccg ccgctccgtc ctgcaggaat acaacatggc 30960 
agtggtctcc tcagcgatga ttcgcaccgc ccgcagcata aggcgccttg tcctccgggc 31020 
acagcagcgc accctgatct cacttaaatc agcacagtaa ctgcagcaca gcaccacaat 31080 
attgttcaaa atcccacagt gcaaggcgct gtatccaaag ctcatggcgg ggaccacaga 31140 
acccacgtgg ccatcatacc acaagcgcag gtagattaag tggcgacccc tcataaacac 31200 
gctggacata aacattacct cttttggcat gttgtaattc accacctccc ggtaccatat 31260 
aaacctctga ttaaacatgg cgccatccac caccatccta aaccagctgg ccaaaacctg 31320 
cccgccggct atacactgca gggaaccggg actggaacaa tgacagtgga gagcccagga 31380 
ctcgtaacca tggatcatca tgctcgtcat gatatcaatg ttggcacaac acaggcacac 31440 
gtgcatacac ttcctcagga ttacaagctc ctcccgcgtt agaaccatat cccagggaac 31500 
aacccattcc tgaatcagcg taaatcccac actgcaggga agacctcgca cgtaactcac 31560 
gttgtgcatt gtcaaagtgt tacattcggg cagcagcgga tgatcctcca gtatggtagc 31620 
gcgggtttct gtctcaaaag gaggtagacg atccctactg tacggagtgc gccgagacaa 31680 
ccgagatcgt gttggtcgta gtgtcatgcc aaatggaacg ccggacgtag tcatatttcc 31740 
tgaagcaaaa ccaggtgcgg gcgtgacaaa cagatctgcg tctccggtct cgccgcttag 31800 
atcgctctgt gtagtagttg tagtatatcc actctctcaa agcatccagg cgccccctgg 31860 
cttcgggttc tatgtaaact ccttcatgcg ccgctgccct gataacatcc accaccgcag 31920 
aataagccac acccagccaa cctacacatt cgttctgcga gtcacacacg ggaggagcgg 31980 
gaagagctgg aagaaccatg tttttttttt tattccaaaa gattatccaa aacctcaaaa 32040 
tgaagatcta ttaagtgaac gcgctcccct ccggtggcgt ggtcaaactc tacagccaaa 32100 
gaacagataa tggcatttgt aagatgttgc acaatggctt ccaaaaggca aacggccctc 32160 
acgtccaagt ggacgtaaag gctaaaccct tcagggtgaa tctcctctat aaacattcca 32220 
gcaccttcaa ccatgcccaa ataattctca tctcgccacc ttctcaatat atctctaagc 32280 
aaatcccgaa tattaagtcc ggccattgta aaaatctgct ccagagcgcc ctccaccttc 32340 
agcctcaagc agcgaatcat gattgcaaaa attcaggttc ctcacagacc tgtataagat 32400 
tcaaaagcgg aacattaaca aaaataccgc gatcccgtag gtcccttcgc agggccagct 32460 
gaacataatc gtgcaggtct gcacggacca gcgcggccac ttccccgcca ggaaccttga 32520 
caaaagaacc cacactgatt atgacacgca tactcggagc tatgctaacc agcgtagccc 32580 
cgatgtaagc tttgttgcat . gggcggcgat ataaaatgca aggtgctgct caaaaaatca 32640 
ggcaaagcct cgcgcaaaaa agaaagcaca tcgtagtcat gctcatgcag ataaaggcag 32700 
gtaagctccg gaaccaccac agaaaaagac accatttttc tctcaaacat gtctgcgggt 32760 
ttctgcataa acacaaaata aaataacaaa aaaacattta aacattagaa gcctgtctta 32820 
caacaggaaa aacaaccctt ataagcataa gacggactac ggccatgccg gcgtgaccgt 32880 
aaaaaaactg gtcaccgtga ttaaaaagca ccaccgacag ctcctcggtc atgtccggag 32940 
tcataatgta agactcggta aacacatcag gttgattcat cggtcagtgc taaaaagcga 33000 
ccgaaatagc ccgggggaat acatacccgc aggcgtagag acaacattac agcccccata 33060 
ggaggtataa caaaattaat aggagagaaa aacacataaa cacctgaaaa accctcctgc 33120 
ctaggcaaaa tagcaccctc ccgctccaga acaacataca gcgcttcaca gcggcagcct 33180 
aacagtcagc cttaccagta aaaaagaaaa cctattaaaa aaacaccact cgacacggca 33240 
ccagctcaat cagtcacagt gtaaaaaagg gccaagtgcg ttacactgca gcaggtgtga 33300 
ctcagccatg gcacctctgc agcctgggta ccctgcttgg ggcatggccc cttatagctg 33360 
ggcggggcgt gggggctctg taggagtggc agcgacctca gtgtttgtct ttgctctgaa 33420 
gagccctcca ggtgcttgat cccacctttt cccagcagga acactcctgc ctgccttacc 33480 
acctgtcctg gctgatggcc tgttcctgcc tcctttgccc cctgcccaga ctcccatgtt 33540 
cctggacttg tggcttcctc caaccagggg ctctcaagcc tccatacctg gtcccacctc 33600 
tccaggccgt gggagggagg ttgaggaggg tggagggcat ctggttgggg gcagcctggg 33660 
tgttcccctc ccatcccctc cctgggcctc ccaggccccc tctactcttg agcaatgctc 33720 
ttgagagctt cctgcctggc tcttaaccca gggcaagccc tggaagggca gacccaggac 33780 
actctcacca cctccttacc ttttcccctg gaaaaatctt ctgtatactt cccattttaa 33840 
gaaaactaca attcccaaca catacaagtt actccgccct aaaacctacg tcacccgccc 33900 
cgttcccacg ccccgcgcca cgtcacaaac tccaccccct cattatcata ttggcttcaa 33960 
tccaaaataa ggtatattat tgatgatg 33988 

<210> 15 
<211> 34737 
<212> DNA 
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<213> Adenovirus subgroup C 
<400> 15 

catcatcaat aatatacctt attttggatt 

ttgtgacgtg gcgcggggcg tgggaacggg 

gatgttgcaa gtgtggcgga acacatgtaa 

gtgtgcgccg gtgtacacag gaagtgacaa 

taaatttggg cgtaaccgag taagatttgg 

agtgaaatct gaataatttt gtgttactca 

gactttgacc gtttacgtgg agactcgccc 

cgggtcaaag ttggcgtttt attattatag 

tgagttcctc aagaggccac tcttgagtgc 

tccgacaccg ggactgaaaa tgagacatga 

ccattttgaa ccacctaccc ttcacgaact 

tcccaacgag gaggcggttt cgcagatttt 

agggattgac ttactcactt ttccgccggc 

ccggcagccc gagcagccgg agcagagagc 

tccacccagt gacgacgagg atgaagaggg 

ccccgggcac ggttgcaggt cttgtcatta 

tatgtgttcg ctttgctata tgaggacctg 

atgggcagtg ggtgatagag tggtgggttt 

gttttgtggt ttaaagaatt ttgtattgtg 

gagcctgagc ccgagccaga accggagcct 

cctgctatcc tgagacgccc gacatcacct 

agctgtgact ccggtccttc taacacacct 

cccattaaac cagttgccgt gagagttggt 

gacttgctta acgagcctgg gcaacctttg 

ggtgtaaacc tgtgattgcg tgtgtggtta 

agtttaataa agggtgagat aatgtttaac 

aaagggtata taatgcgccg tgggctaatc 

gagtgtttgg aagatttttc tgctgtgcgt 

tcttggtttt ggaggtttct gtggggctca 

gaggattaca agtgggaatt tgaagagctt 

ttgaatctgg gtcaccaggc gcttttccaa 

acaccggggc gcgctgcggc tgctgttgct 

gaagaaaccc atctgagcgg ggggtacctg 

gcggttgtga gacacaagaa tcgcctgcta 

ccgacggagg agcagcagca gcagcaggag 

ccatggaacc cgagagccgg cctggaccct 

tgtatccaga actgagacgc attttgacaa 

taaagaggga gcggggggct tgtgaggcta 

taatgaccag acaccgtcct gagtgtatta 

atgagcttga tctgctggcg cagaagtatt 

agccagggga tgattttgag gaggctatta 

attgcaagta caagatcagc aaacttgtaa 

acggggccga ggtggagata gatacggagg 

atatgtggcc gggggtgctt ggcatggacg 

gccccaattt tagcggtacg gttttcctgg 

gcttctatgg gtttaacaat acctgtgtgg 

gtgcctttta ctgctgctgg aagggggtgg 

agaaatgcct ctttgaaagg tgtaccttgg 

gccacaatgt ggcctccgac tgtggttgct 

agcataacat ggtatgtggc aactgcgagg 

acggcaactg tcacctgctg aagaccattc 

cagtgtttga gcataacata ctgacccgct 

tgttcctacc ttaccaatgc aatttgagtc 

tgtccaaggt gaacctgaac ggggtgtttg 

ggtacgatga gacccgcacc aggtgcagac 

accagcctgt gatgctggat gtgaccgagg 

gcacccgcgc tgagtttggc tctagcgatg 

ggcgtggctt aagggtggga aagaatatat 

gttttgcagc agccgccgcc gccatgagca 

catatttgac aacgcgcatg cccccatggg 

gcattgatgg tcgccccgtc ctgcccgcaa 

ctggaacgcc gttggagact gcagcctccg 

gcgggattgt gactgacttt gctttcctga 



gaagccaata tgataatgag ggggtggagt 60 
gcgggtgacg tagtagtgtg gcggaagtgt 120 
gcgacggatg tggcaaaagt gacgtttttg 180 
ttttcgcgcg gttttaggcg gatgttgtag 240 
ccattttcgc gggaaaactg aataagagga 300 
tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc. cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 1440 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 1740 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcgtfca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 24 60 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 2760 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggct.cca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
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catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc. cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 
gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 
gggcagcagg cgcgcgtcga agtagtctat 
ccatgcgcgg gcggcaagcg cgcgctcgta 
gtgggtgagc gcggaggcgt acatgccgca 
tattccaaga tatgtagggt agcatcttcc 
tagttcgtgc gagggagcga ggaggtcggg 
tcggaagact atctgcctga agatggcatg 
gacgttgaag ctggcgtctg tgagacctac 
gcgcagcttg ttgaccagct cggcggtgac 
ttccttgatg atgtcatact tatcctgtcc 
aaactcttcg cggtctttcc agtactcttg 
agagcctagc atgtagaact ggttgacggc 
tagcgcgtat gcctgcgcgg ccttccggag 
gaccatgact ttgaggtact ggtatttgaa 
gagcaaaaag tccgtgcgct ttttggaacg 
gaagagtatc tttcccgcgc gaggcataaa 
ctcggaacgg ttgttaatta cctgggcggc 
gtggcccaca atgtaaagtt ccaagaagcg 
aagttcctcg taggtgagct cttcagggga 
tgcaagatga gggttggaag cgacgaatga 
caggtggtcg cgaaaggtcc taaactggcg 
gtagaaggta agcgggtctt gttcccagcg 



ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
taggggtttt gcgcgcgcgg taggcccggg 4020 
gtattttttc caggacgtgg taaaggtgac 4080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
gcatacgtgg ggatatgaga tgcatcttgg 4320 
ccatatccct ccggggattc atgttgtgca 4380 
tgggaaattt gtcatgtagc ttagaaggaa 44 4 0 
gacctccaag attttccatg cattcgtcca 4500 
cctgggcgaa gatatttctg ggatcactaa 4560 
cataggccat ttttacaaag cgcgggcgga 4620 
ccggcccagg ggcgtagtta ccctcacaga 4680 
gggggatcat gtctacctgc ggggcgatga 4740 
gctgggaaga aagcaggttc ctgagcagct 4800 
tcacacctat taccgggtgc aactggtagt 4860 
gcaggggggc cacttcgtta agcatgtccc 4 920 
ccagaaggcg ctcgccgccc agcgatagca 4 980 
gtttgagacc gtccgccgta ggcatgcttt 5040 
cccacagctc ggtcacctgc tctacggcat 5100 
gttggggcgg ctttcgctgt acggcagtag 5160 
gtctttccac gggcgcaggg tcctcgtcag 5220 
tccgggctgc gcgctggcca gggtgcgctt 5280 
ccggtcttcg ccctgcgcgt cggccaggta 5340 
ctccgcggcg tggcccttgg cgcgcagctt 5400 
gtgcagactt ttgagggcgt agagcttggg 5460 
atccgcgccg caggccccgc agacggtctc 5520 
ttcggggtca aaaaccaggt ttcccccatg 5580 
catgagccgg tgtccacgct cggtgacgaa 5640 
gagaggcctg tcctcgagcg gtgttccgcg 5700 
tgagacaaag gctcgcgtcc aggccagcac 5760 
gttgtccact agggggtcca ctcgctccag 5820 
atcaaggaag gtgattggtt tgtaggtgta 5880 
gctataaaag ggggtggggg cgcgttcgtc 5940 
ggccagctgt tggggtgagt actccctctg 6000 
gtcagtttcc aaaaacgagg aggatttgat 6060 
ggtggccgca tccatctggt cagaaaagac 6120 
cgacccgtag agggcgttgg acagcaactt 6180 
gcgatcggcg cgctccttgg ccgcgatgtt 6240 
ccattcggga aagacggtgg tgcgctcgtc 6300 
gtgcagggtg acaaggtcaa cgctggtggc 6360 
gcagaggcgg ccgcccttgc gcgagcagaa 6420 
cggggggtct gcgtccacgg taaagacccc 6480 
cttgcatcct tgcaagtcta gcgcctgctg 6540 
tgggttgagt gggggacccc atggcatggg 6600 
aatgtcgtaa acgtagaggg gctctctgag 6660 
accgcggatg ctggcgcgca cgtaatcgta 6720 
accgaggttg ctacgggcgg gctgctctgc 6780 
tgagttggat gatatggttg gacgctggaa 6840 
cgcgtcacgc acgaaggagg cgtaggagtc 6900 
ctgcacgtct agggcgcagt agtccagggt 6960 
cttttttttc cacagctcgc ggttgaggac 7020 
gatcggaaac ccgtcggcct ccgaacggta 7080 
ctggtaggcg cagcatccct tttctacggg 7140 
cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gtcagtgtcg tcgcatccgc cctgctccca 7260 
cggatttggc agggcgaagg tgacatcgtt 7320 
gttgcgtgtg atgcggaagg gtcccggcac 7380 
gagcacgatc tcgtcaaagc cgttgatgtt 7440 
cgggatgccc ttgatggaag gcaatttttt 7500 
gctgagcccg tgctctgaaa gggcccagtc 7560 
gctccacagg tcacgggcca ttagcatttg 7620 
acctatggcc attttttctg gggtgatgca 7680 
gtcccatcca aggttcgcgg ctaggtctcg 7740 
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cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 7800 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 7860 
ctcggtgcga ggatgcgagc cgatcgggaa gaactggatc tcccgccacc aattggagga 7920 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 7980 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 8040 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 8100 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 8160 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 8220 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 8340 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 84 60 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8640 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 8700 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8940 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 9000 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 9420 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 9480 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 
cggcggtcgg ggttgtttct ggcggaggtg . ctgctgatga tgtaattaaa gtaggcggtc 9720 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
gtcaggaggg gcgacatccg cggttgacgc ggcagcagat ggtgattacg aacccccgcg 11100 
gcgccgggcc cggcactacc tggacttgga ggagggcgag ggcctggcgc ggctaggagc 11160 
gccctctcct gagcggtacc caagggtgca gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gccgcggcag aacctgtttc gcgaccgcga gggagaggag cccgaggaga tgcgggatcg 11280 
aaagttccac gcagggcgcg agctgcggca tggcctgaat cgcgagcggt tgctgcgcga 11340 
ggaggacttt gagcccgacg cgcgaaccgg gattagtccc gcgcgcgcac acgtggcggc 11400 
cgccgacctg gtaaccgcat acgagcagac ggtgaaccag gagattaact ttcaaaaaag 11460 
ctttaacaac cacgtgcgta cgcttgtggc gcgcgaggag gtggctatag gactgatgca 11520 
tctgtgggac tttgtaagcg cgctggagca aaacccaaat agcaagccgc tcatggcgca 11580 
gctgttcctt atagtgcagc acagcaggga caacgaggca ttcagggatg cgctgctaaa 11640 
catagtagag cccgagggcc gctggctgct cgatttgata aacatcctgc agagcatagt 11700 
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ggtgcaggag 

tagcctgggc 

ggaggtaaag 

cgacctgggc 

cgagctcagc 

cggcgataga 

ccgacgcgcc 

tggcaacgtc 

cgagtactaa 

cgggcggcgc 

atggaccgca 

gccaaccggc 

gagaaggtgc 

gccggcctgg 

cagaccaacc 

gcgcagcagc 

cccgccaacg 

atggtgactg 

accagtagac 

ctgtgggggg 

aactcgcgcc 

gacacatacc 

gacgagcata 

ggcagcctgg 

ttgcacagtt 

cttaacctga 

atggaaccgg 

catcgcgcgg 

ctaccgcccc 

ctctgggacg 

caacagcgcg 

ttgtccgatc 

atagggtctc 

ctaaacaact 

aacgggatag 

agggacgtgc 

ctggtgtggg 

ggcaacccgt 

atgatgcaaa 

cccttagtat 

tggtgagcgc 

cgccgtttgt 

ctgagttggc 

atgtggcatc 

acaatgacta 

actggggcgg 

tgtttaccaa 

aggtggagct 

ccatgaccat 

agaacggggt 

ggtttgaccc 

cagacatcat 

tgttgggcat 

tggagggtgg 

atgacaccga 

aagagaactc 

ccattcgcgg 

cggccgaagc 

tgatcaaacc 

gcaccttcac 

gaatccgctc 

actggtcgtt 

gcaactttcc 

accaggccgt 

gctttcccga 

aaaacgttcc 



cgcagcttga 

aagttttacg 

atcgaggggt 

gtttatcgca 

gaccgcgagc 

gaggccgagt 

ctggaggcag 

ggcggcgtgg 

gcggtgatgt 

tgcagagcca 

tcatgtcgct 

tctccgcaat 

tggcgatcgt 

tctacgacgc 

tggaccggct 

agggcaacct 

tgccgcgggg 

agacaccgca 

aaggcctgca 

tgcgggctcc 

tgttgctgct 

taggtcactt 

ctttccagga 

aggcaaccct 

taaacagcga 

tgcgcgacgg 

gcatgtatgc 

ccgccgtgaa 

ctggtttcta 

acatagacga 

agcaggcaga 

taggcgctgc 

ttaccagcac 

cgctgctgca 

agagcctagt 

caggcccgcg 

aggacgatga 

ttgcgcacct 

ataaaaaact 

gcggcgcgcg 

ggcgccagtg 

gcctccgcgg 

acccctattc 

cctgaactac 

cagcccgggg 

cgacctgaaa 

taagtttaag 

gaaatacgag 

agaccttatg 

tctggaaagc 

cgtcactggt 

tttgctgcca 

ccgcaagcgg 

taacattccc 

acagggcggg 

caacgcggca 

cgacaccttt 

tgccgccccc 

cctgacagag 

ccagtaccgc 

atggaccctg 

gccagacatg 

ggtggtgggc 

ctactcccaa 

gaaccagatt 

tgctctcaca 



gcctggctga 
cccgcaagat 
tctacatgcg 
acgagcgcat 
tgatgcacag 
cctactttga 
ctggggccgg 
aggaatatga 
ttctgatcag 
gccgtccggc 
gactgcgcgc 
tctggaagcg 
aaacgcgctg 
gctgcttcag 
ggtgggggat 
gggctccatg 
acaggaggac 
aagtgaggtg 
gaccgtaaac 
cacaggcgac 
gctaatagcg 
gctgacactg 
gattacaagt 
aaactacctg 
ggaggagcgc 
ggtaacgccc 
ctcaaaccgg 
ccccgagtat 
caccggggga 
cagcgtgttt 
ggcggcgctg 
ggccccgcgg 
tcgcaccacc 
gccgcagcgc 
ggacaagatg 
cccgcccacc 
ctcggcagac 
tcgccccagg 
caccaaggcc 
gcgatgtatg 
gcggcggcgc 
tacctgcggc 
gacaccaccc 
cagaacgacc 
gaggcaagca 
accatcctgc 
gcgcgggtga 
tgggtggagt 
aacaacgcga 
gacatcgggg 
cttgtcatgc 
ggatgcgggg 
caacccttcc 
gcactgttgg 
ggtggcgcag 
gccgcggcaa 
gccacacggg 
gctgcgcaac 
gacagcaaga 
agctggtacc 
ctttgcactc 
atgcaagacc 
gccgagctgt 
ctcatccgcc 
ttggcgcgcc 
gatcacggga 



caaggtggcc 
ataccatacc 
catggcgctg 
ccacaaggcc 
cctgcaaagg 
cgcgggcgct 
acctgggctg 
cgaggacgat 
atgatgcaag 
cttaactcca 
aatcctgacg 
gtggtcccgg 
gccgaaaaca 
cgcgtggctc 
gtgcgcgagg 
gttgcactaa 
tacaccaact 
taccagtctg 
ctgagccagg 
cgcgcgaccg 
cccttcacgg 
taccgcgagg 
gtcagccgcg 
ctgaccaacc 
attttgcgct 
agcgtggcgc 
ccgtttatca 
ttcaccaatg 
ttcgaggtgc 
tccccgcaac 
cgaaaggaaa 
tcagatgcta 
cgcccgcgcc 
gaaaaaaacc 
agtagatgga 
cgtcgtcaaa 
gacagcagcg 
ctggggagaa 
atggcaccga 
aggaaggtcc 
tgggttctcc 
ctaccggggg 
gtgtgtacct 
acagcaactt 
cacagaccat 
ataccaacat 
tggtgtcgcg 
tcacgctgcc 
tcgtggagca 
taaagtttga 
ctggggtata 
tggacttcac 
aggagggctt 
atgtggacgc 
gcggcagcaa 
tgcagccggt 
ctgaggagaa 
ccgaggtcga 
aacgcagtta 
ttgcatacaa 
ctgacgtaac 
ccgtgacctt 
tgcccgtgca 
agtttacctc 
cgccagcccc 
cgctaccgct 



gccatcaact 

ccttacgttc 

aaggtgctta 

gtgagcgtga 

gccctggctg 

gacctgcgct 

gcggtggcac 

gagtacgagc 

acgcaacgga 

cggacgactg 

cgttccggca: 

cgcgcgcaaa 

gggccatccg 

gttacaacag 

ccgtggcgca 

acgccttcct 

ttgtgagcgc 

ggccagacta 

ctttcaaaaa 

tgtctagctt 

acagtggcag 

ccataggtca 

cgctggggca 

ggcggcagaa 

acgtgcagca 

tggacatgac 

accgcctaat 

ccatcttgaa 

ccgagggtaa 

cgcagaccct 

gcttccgcag 

gtagcccatt 

tgctgggcga 

tgcctccggc 

agacgtacgc 

ggcacgaccg 

tcctggattt 

tgttttaaaa 

gcgttggttt 

tcctccctcc 

cttcgatgct 

gagaaacagc 

ggtggacaac 

tctgaccacg 

caatcttgac 

gccaaatgtg 

cttgcctact 

cgagggcaac 

ctacttgaaa 

cacccgcaac 

tacaaacgaa 

ccacagccgc 

taggatcacc 

ctaccaggcg 

cagcagtggc 

ggaggacatg 

gcgcgctgag 

gaagcctcag 

caacctaata 

ctacggcgac 

ctgcggctcg 

ccgctccacg 

ctccaagagc 

tctgacccac 

caccatcacc 

gcgcaacagc 



attccatgct 
ccatagacaa 
ccttgagcga 
gccggcggcg 
gcacgggcag 
gggccccaag 
ccgcgcgcgc 
cagaggacgg 
cccggcggtg 
gcgccaggtc 
gcagccgcag 
ccccacgcac 
gcccgacgag 
cggcaacgtg 
gcgtgagcgc 
gagtacacag 
actgcggcta 
ttttttccag 
cttgcagggg 
gctgacgccc 
cgtgtcccgg 
ggcgcatgtg 
ggaggacacg 
gatcccctcg 
gagcgtgagc 
cgcgcgcaac 
ggactacttg 
cccgcactgg 
cgatggattc 
gctagagttg 
gccaagcagc 
tccaagcttg 
ggaggagtac 
atttcccaac 
gcaggagcac 
tcagcggggt 
gggagggagt 
aaaaaaaagc 
tcttgtattc 
tacgagagtg 
cccctggacc 
atccgttact 
aagtcaacgg 
gtcattcaaa 
gaccggtcgc 
aacgagttca 
aaggacaatc 
tactccgaga 
gtgggcagac 
ttcagactgg 
gccttccatc 
ctgagcaact 
tacgatgatc 
agcttgaaag 
agcggcgcgg 
aacgatcatg 
gccgaagcag 
aagaaaccgg 
agcaatgaca 
cctcagaccg 
gagcaggtct 
cgccagatca 
ttctacaacg 
gtgttcaatc 
accgtcagtg 
atcggaggag 



11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
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tccagcgagt gaccattact gacgccagac 

tgggcatagt ctcgccgcgc gtcctatcga 

ttatatcgcc cagcaataac acaggctggg 

gggccaagaa gcgctccgac caacacccag 

ggggcgcgca caaacgcggc cgcactgggc 

tggtggagga ggcgcgcaac tacacgccca 

ccattcagac cgtggtgcgc ggagcccggc 

gcgtagcacg tcgccaccgc cgccgacccg 

tgcttaaccg cgcacgtcgc accggccgac 

ccgcgggtat tgtcactgtg ccccccaggt 

cggccattag tgctatgact cagggtcgca 

ttagcggcct gcgcgtgccc gtgcgcaccc 

actacttaga ctcgtactgt tgtatgtatc 

ccaagcgcaa aatcaaagaa gagatgctcc 

cgaagaagga agagcaggat tacaagcccc 

aagatgatga tgatgaactt gacgacgagg 

gacgggtaca gtggaaaggt cgacgcgtaa 

tctttacgcc cggtgagcgc tccacccgca 

gcgacgagga cctgcttgag caggccaacg 

ggcataagga catgctggcg ttgccgctgg 

ccgtaacact gcagcaggtg ctgcccgcgc 

agcgcgagtc tggtgacttg gcacccaccg 

tggaagatgt cttggaaaaa atgaccgtgg 

ggccaatcaa gcaggtggcg ccgggactgg 

ctaccagtag caccagtatt gccaccgcca 

ttgcctcagc ggtggcggat gccgcggtgc 

ctacggaggt gcaaacggac ccgtggatgt 

gttcgaggaa gtacggcgcc gccagcgcgc 

ttgcgcctac ccccggctat cgtggctaca 

gacgccgaac caccactgga acccgccgcc 

cgatttccgt gcgcagggtg gctcgcgaag 

gctaccaccc cagcatcgtt taaaagccgg 

cctgccgcct ccgtttcccg gtgccgggat 

tggccggcca cggcctgacg ggcggcatgc 

cgcaccgtcg catgcgcggc ggtatcctgc 

ttggcgccgt gcccggaatt gcatccgtgg 

caagttgcat gtggaaaaat caaaataaaa 

taactatttt gtagaatgga agacatcaac 

cgcccgttca tgggaaactg gcaagatatc 

agctggggct cgctgtggag cggcattaaa 

agcaaggcct ggaacagcag cacaggccag 

ttccaacaaa aggtggtaga tggcctggcc 

aaccaggcag tgcaaaataa gattaacagt 

cctccaccgg ccgtggagac agtgtctcca 

gacagggaag aaactctggt gacgcaaata 

aagcaaggcc tgcccaccac ccgtcccatc 

cacacacccg taacgctgga cctgcctccc 

ccaggcccga ccgccgttgt tgtaacccgt 

agcggtccgc gatcgttgcg gcccgtagcc 

atcgtgggtc tgggggtgca atccctgaag 

tcgtatgtgt gtcatgtatg cgtccatgtc 

gcccgctttc caagatggct accccttcga 

cgggccagga cgcctcggag tacctgagcc 

agacgtactt cagcctgaat aacaagttta 

tgaccacaga ccggtcccag cgtttgacgc 

ctgcgtactc gtacaaggcg cggttcaccc 

tggcttccac gtactttgac atccgcggcg 

actctggcac tgcctacaac gccctggctc 

atgaagctgc tactgctctt gaaataaacc 

aagtagacga gcaagctgag cagcaaaaaa 

gtataaatat tacaaaggag ggtattcaaa 

ccgataaaac atttcaacct gaacctcaaa 

ttaatcatgc agctgggaga gtccttaaaa 

catatgcaaa acccacaaat gaaaatggag 

gaaagctaga aagtcaagtg gaaatgcaat 

atggtgataa cttgactcct aaagtggtat 



gccgcacctg cccctacgtt tacaaggccc 15720 
gccgcacttt ttgagcaagc atgtccatcc 15780 
gcctgcgctt cccaagcaag atgtttggcg 15840 
tgcgcgtgcg cgggcactac cgcgcgccct 15900 
gcaccaccgt cgatgacgcc atcgacgcgg 15960 
cgccgccacc agtgtccaca gtggacgcgg 16020 
gctatgctaa aatgaagaga cggcggaggc 16080 
gcactgccgc ccaacgcgcg gcggcggccc 16140 
gggcggccat gcgggccgct cgaaggctgg 16200 
ccaggcgacg agcggccgcc gcagcagccg 16260 
ggggcaacgt gtattgggtg cgcgactcgg 16320 
gccccccgcg caactagatt gcaagaaaaa 16380 
cagcggcggc ggcgcgcaac gaagctatgt 1644 0 
aggtcatcgc gccggagatc tatggccccc 16500 
gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
tggaactgct gcacgctacc gcgcccaggc 16620 
aacgtgtttt gcgacccggc accaccgtag 16680 
cctacaagcg cgtgtatgat gaggtgtacg 16740 
agcgcctcgg ggagtttgcc tacggaaagc 16800 
acgagggcaa cccaacacct agcctaaagc 16860 
ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
tgcagctgat ggtacccaag cgccagcgac 16980 
aacctgggct ggagcccgag gtccgcgtgc 17040 
gcgtgcagac cgtggacgtt cagataccca 17100 
cagagggcat ggagacacaa acgtccccgg 17160 
aggcggtcgc tgcggccgcg tccaagacct 17220 
ttcgcgtttc agccccccgg cgcccgcgcg 17280 
tactgcccga atatgcccta catccttcca 17340 
cctaccgccc cagaagacga gcaactaccc 17400 
gccgtcgccg tcgccagccc gtgctggccc 174 60 
gaggcaggac cctggtgctg ccaacagcgc 17520 
tctttgtggt tcttgcagat atggccctca 17580 
tccgaggaag aatgcaccgt aggaggggca 17640 
gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
ccctccttat tccactgatc gccgcggcga 17760 
ccttgcaggc gcagagacac tgattaaaaa 17820 
agtctggact ctcacgctcg cttggtcctg 17880 
tttgcgtctc tggccccgcg acacggctcg 17940 
ggcaccagca atatgagcgg tggcgccttc 18000 
aatttcggtt ccaccgttaa gaactatggc 18060 
atgctgaggg ataagttgaa agagcaaaat 18120 
tctggcatta gcggggtggt ggacctggcc 18180 
aagcttgatc cccgccctcc cgtagaggag 18240 
gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacgagcctc cctcgtacga ggaggcacta 18360 
gcgcccatgg ctaccggagt gctgggccag 18420 
cccgccgaca cccagcagaa acctgtgctg 18480 
cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agtggcaact ggcaaagcac actgaacagc 18600 
cgccgacgat gcttctgaat agctaacgtg 18660 
gccgccagag gagctgctga gccgccgcgc 18720 
tgatgccgca gtggtcttac atgcacatct 18780 
ccgggctggt gcagtttgcc cgcgccaccg 18840 
gaaaccccac ggtggcgcct acgcacgacg 18900 
tgcggttcat ccctgtggac cgtgaggata 18960 
tagctgtggg tgataaccgt gtgctggaca 19020 
tgctggacag gggccctact tttaagccct 19080 
ccaagggtgc cccaaatcct tgcgaatggg 19140 
tagaagaaga ggacgatgac aacgaagacg 19200 
ctcacgtatt tgggcaggcg ccttattctg 19260 
taggtgtcga aggtcaaaca cctaaatatg 19320 
taggagaatc tcagtggtac gaaactgaaa 19380 
agactacccc aatgaaacca tgttacggtt 19440 
ggcaaggcat tcttgtaaag caacaaaatg 19500 
ttttctcaac tactgaggcg accgcaggca 19560 
tgtacagtga agatgtagat atagaaaccc 19620 
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cagacactca tatttcttac atgcccacta ttaaggaagg taactcacga gaactaatgg 19680 
gccaacaatc tatgcccaac aggcctaatt acattgcttt tagggacaat tttattggtc 19740 
taatgtatta caacagcacg ggtaatatgg gtgttctggc gggccaagca tcgcagttga 19800 
atgctgttgt agatttgcaa gacagaaaca cagagctttc ataccagctt ttgcttgatt 19860 
ccattggtga tagaaccagg tacttttcta tgtggaatca ggctgttgac agctatgatc 19920 
cagatgttag aattattgaa aatcatggaa ctgaagatga acttccaaat tactgctttc 19980 
cactgggagg tgtgattaat acagagactc ttaccaaggt aaaacctaaa acaggtcagg 20040 
aaaatggatg ggaaaaagat gctacagaat tttcagataa aaatgaaata agagttggaa 20100 
ataattttgc catggaaatc aatctaaatg ccaacctgtg gagaaatttc ctgtactcca 20160 
acatagcgct gtatttgccc gacaagctaa agtacagtcc ttccaacgta aaaatttctg 20220 
ataacccaaa cacctacgac tacatgaaca agcgagtggt ggctcccggg ttagtggact 20280 
gctacattaa ccttggagca cgctggtccc ttgactatat ggacaacgtc aacccattta 20340 
accaccaccg caatgctggc ctgcgctacc gctcaatgtt gctgggcaat ggtcgctatg 20400 
tgcccttcca catccaggtg cctcagaagt tctttgccat taaaaacctc cttctcctgc 204 60 
cgggctcata cacctacgag tggaacttca ggaaggatgt taacatggtt ctgcagagct 20520 
ccctaggaaa tgacctaagg gttgacggag ccagcattaa gtttgatagc atttgccttt 20580 
acgccacctt cttccccatg gcccacaaca ccgcctccac gcttgaggcc atgcttagaa 20640 
acgacaccaa cgaccagtcc tttaacgact atctctccgc cgccaacatg ctctacccta 20700 
tacccgccaa cgctaccaac gtgcccatat ccatcccctc ccgcaactgg gcggctttcc 20760 
gcggctgggc cttcacgcgc cttaagacta aggaaacccc atcactgggc tcgggctacg 20820 
acccttatta cacctactct ggctctatac cctacctaga tggaaccttt tacctcaacc 20880 
acacctttaa gaaggtggcc attacctttg actcttctgt cagctggcct ggcaatgacc 20940 
gcctgcttac ccccaacgag tttgaaatta agcgctcagt tgacggggag ggttacaacg 21000 
ttgcccagtg taacatgacc aaagactggt tcctggtaca aatgctagct aactacaaca 21060 
ttggctacca gggcttctat atcccagaga gctacaagga ccgcatgtac tccttcttta 21120 
gaaacttcca gcccatgagc cgtcaggtgg tggatgatac taaatacaag gactaccaac 21180 
aggtgggcat cctacaccaa cacaacaact ctggatttgt tggetacctt gcccccacca 21240 
tgcgcgaagg acaggcctac cctgctaact tcccctatcc gcttataggc aagaccgcag 21300 
ttgacagcat tacccagaaa aagtttcttt gcgatcgcac cctttggcgc atcccattct 21360 
ccagtaactt tatgtccatg ggcgcactca cagacctggg ccaaaacctt ctctacgcca 21420 
actccgccca cgcgctagac atgacttttg aggtggatcc catggacgag cccacccttc 21480 
tttatgtttt gtttgaagtc tttgacgtgg tccgtgtgca ccggccgcac cgcggcgtca 21540 
tcgaaaccgt gtacctgcgc acgcccttct cggccggcaa cgccacaaca taaagaagca 21600 
agcaacatca acaacagctg ccgccatggg ctccagtgag caggaactga aagccattgt 21660 
caaagatctt ggttgtgggc catatttttt gggcacctat gacaagcgct ttccaggctt 21720 
tgtttctcca cacaagctcg cctgcgccat agtcaatacg gccggtcgcg agactggggg 21780 
cgtacactgg atggcctttg cctggaaccc gcactcaaaa acatgctacc tctttgagcc 21840 
ctttggcttt tctgaccagc gactcaagca ggtttaccag tttgagtacg agtcactcct 21900 
gcgccgtagc gccattgctt cttcccccga ccgctgtata acgctggaaa agtccaccca 21960 
aagcgtacag gggcccaact cggccgcctg tggactattc tgctgcatgt ttctccacgc 22020 
ctttgccaac tggccccaaa ctcccatgga tcacaacccc accatgaacc ttattaccgg 22080 
ggtacccaac tccatgctca acagtcccca ggtacagccc accctgcgtc gcaaccagga 22140 
acagctctac agcttcctgg agcgccactc gccctacttc cgcagccaca gtgcgcagat 22200 
taggagcgcc acttcttttt gtcacttgaa aaacatgtaa aaataatgta ctagagacac 22260 
tttcaataaa ggcaaatgct tttatttgta cactctcggg tgattattta cccccaccct 22320 
tgccgtctgc gccgtttaaa aatcaaaggg gttctgccgc gcatcgctat gcgccactgg 22380 
cagggacacg ttgcgatact ggtgtttagt gctccactta aactcaggca caaccatccg 22440 
cggcagctcg gtgaagtttt cactccacag gctgcgcacc atcaccaacg cgtttagcag 22500 
gtcgggcgcc gatatcttga agtcgcagtt ggggcctccg ccctgcgcgc gcgagttgcg 22560 
atacacaggg ttgcagcact ggaacactat cagcgccggg tggtgcacgc tggccagcac 22620 
gctcttgtcg gagatcagat ccgcgtccag gtcctccgcg ttgctcaggg cgaacggagt 22680 
caactttggt agctgccttc ccaaaaaggg cgcgtgccca ggctttgagt tgcactcgca 22740 
ccgtagtggc atcaaaaggt gaccgtgccc ggtctgggcg ttaggataca gcgcctgcat 22800 
aaaagccttg atctgcttaa aagccacctg agcctttgcg ccttcagaga agaacatgcc 22860 
gcaagacttg ccggaaaact gattggccgg acaggccgcg tcgtgcacgc agcaccttgc 22920 
gtcggtgttg gagatctgca ccacatttcg gccccaccgg ttcttcacga tcttggcctt 22980 
gctagactgc tccttcagcg cgcgctgccc gttttcgctc gtcacatcca tttcaatcac 23040 
gtgctcctta tttatcataa tgcttccgtg tagacactta agctcgcctt cgatctcagc 23100 
gcagcggtgc agccacaacg cgcagcccgt gggctcgtga tgcttgtagg tcacctctgc 23160 
aaacgactgc aggtacgcct gcaggaatcg ccccatcatc gtcacaaagg tcttgttgct 23220 
ggtgaaggtc agctgcaacc cgcggtgctc ctcgttcagc caggtcttgc atacggccgc 23280 
cagagcttcc acttggtcag gcagtagttt gaagttcgcc tttagatcgt tatccacgtg 23340 
gtacttgtcc atcagcgcgc gcgcagcctc catgcccttc tcccacgcag acacgatcgg 23400 
cacactcagc gggttcatca ccgtaatttc actttccgct tcgctgggct cttcctcttc 23460 
ctcttgcgtc cgcataccac gcgccactgg gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cttacctcct ttgccatgct tgattagcac cggtgggttg ctgaaaccca ccatttgtag 23580 
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cgccacatct tctctttctt cctcgctgtc cacgattacc tctggtgatg gcgggcgctc 23640 

gggcttggga gaagggcgct tctttttctt cttgggcgca atggccaaat ccgccgccga 23700 

ggtcgatggc cgcgggctgg gtgtgcgcgg caccagcgcg tcttgtgatg agtcttcctc 23760 

gtcctcggac tcgatacgcc gcctcatccg cttttttggg ggcgcccggg gaggcggcgg 23820 

cgacggggac ggggacgaca cgtcctccat ggttggggga cgtcgcgccg caccgcgtcc 23880 

gcgctcgggg gtggtttcgc gctgctcctc ttcccgactg gccatttcct tctcctatag 23940 

gcagaaaaag atcatggagt cagtcgagaa gaaggacagc ctaaccgccc cctctgagtt 24000 

cgccaccacc gcctccaccg atgccgccaa cgcgcctacc accttccccg tcgaggcacc 24060 

cccgcttgag gaggaggaag tgattatcga gcaggaccca ggttttgtaa gcgaagacga 24120 

cgaggaccgc tcagtaccaa cagaggataa aaagcaagac caggacaacg cagaggcaaa 24180 

cgaggaacaa gtcgggcggg gggacgaaag gcatggcgac tacctagatg tgggagacga 24240 

cgtgctgttg aagcatctgc agcgccagtg cgccattatc tgcgacgcgt tgcaagagcg 24300 

cagcgatgtg cccctcgcca tagcggatgt cagccttgcc tacgaacgcc acctattctc 24360 

accgcgcgta ccccccaaac gccaagaaaa cggcacatgc gagcccaacc cgcgcctcaa 24 420 

cttctacccc gtatttgccg tgccagaggt gcttgccacc tatcacatct ttttccaaaa 24480 

ctgcaagata cccctatcct gccgtgccaa ccgcagccga gcggacaagc agctggcctt 24540 

gcggcagggc gctgtcatac ctgatatcgc ctcgctcaac gaagtgccaa aaatctttga 24 600 

gggtcttgga cgcgacgaga agcgcgcggc aaacgctctg caacaggaaa acagcgaaaa 24 660 

tgaaagtcac tctggagtgt tggtggaact cgagggtgac aacgcgcgcc tagccgtact 24720 

aaaacgcagc atcgaggtca cccactttgc ctacccggca cttaacctac cccccaaggt 24780 

catgagcaca gtcatgagtg agctgatcgt gcgccgtgcg cagcccctgg agagggatgc 24840 

aaatttgcaa gaacaaacag aggagggcct acccgcagtt ggcgacgagc agctagcgcg 24900 

ctggcttcaa acgcgcgagc ctgccgactt ggaggagcga cgcaaactaa tgatggccgc 24 960 

agtgctcgtt accgtggagc ttgagtgcat gcagcggttc tttgctgacc cggagatgca 25020 

gcgcaagcta gaggaaacat tgcactacac ctttcgacag ggctacgtac gccaggcctg 25080 

caagatctcc aacgtggagc tctgcaacct ggtctcctac cttggaattt tgcacgaaaa 25140 

ccgccttggg caaaacgtgc ttcattccac gctcaagggc gaggcgcgcc gcgactacgt 25200 

ccgcgactgc gtttacttat ttctatgcta cacctggcag acggccatgg gcgtttggca 25260 

gcagtgcttg gaggagtgca acctcaagga gctgcagaaa ctgctaaagc aaaacttgaa 25320 

ggacctatgg acggccttca acgagcgctc cgtggccgcg cacctggcgg acatcatttt 25380 

ccccgaacgc ctgcttaaaa ccctgcaaca gggtctgcca gacttcacca gtcaaagcat 254 40 

gttgcagaac tttaggaact ttatcctaga gcgctcagga atcttgcccg ccacctgctg 25500 

tgcacttcct agcgactttg tgcccattaa gtaccgcgaa tgccctccgc cgctttgggg 25560 

ccactgctac cttctgcagc tagccaacta ccttgcctac cactctgaca taatggaaga 25620 

cgtgagcggt gacggtctac tggagtgtca ctgtcgctgc aacctatgca ccccgcaccg 25680 

ctccctggtt tgcaattcgc agctgcttaa cgaaagtcaa attatcggta cctttgagct 25740 

gcagggtccc tcgcctgacg aaaagtccgc ggctccgggg ttgaaactca ctccggggct 25800 

gtggacgtcg gcttaccttc gcaaatttgt acctgaggac taccacgccc acgagattag 25860 

gttctacgaa gaccaatccc gcccgccaaa tgcggagctt accgcctgcg tcattaccca 25920 

gggccacatt cttggccaat tgcaagccat caacaaagcc cgccaagagt ttctgctacg 25980 

aaagggacgg ggggtttact tggaccccca gtccggcgag gagctcaacc caatcccccc 26040 

gccgccgcag ccctatcagc agcagccgcg ggcccttgct tcccaggatg gcacccaaaa 26100 

agaagctgca gctgccgccg ccacccacgg acgaggagga atactgggac agtcaggcag 26160 

aggaggtttt ggacgaggag gaggaggaca tgatggaaga ctgggagagc ctagacgagg 26220 

aagcttccga ggtcgaagag gtgtcagacg aaacaccgtc accctcggtc gcattcccct 26280 

cgccggcgcc ccagaaatcg gcaaccggtt ccagcatggc tacaacctcc gctcctcagg 26340 

cgccgccggc actgcccgtt cgccgaccca accgtagatg ggacaccact ggaaccaggg 26400 

ccggtaagtc caagcagccg ccgccgttag cccaagagca acaacagcgc caaggctacc 26460 

gctcatggcg cgggcacaag aacgccatag ttgcttgctt gcaagactgt gggggcaaca 26520 

tctccttcgc ccgccgcttt cttctctacc atcacggcgt ggccttcccc cgtaacatcc 26580 

tgcattacta ccgtcatctc tacagcccat actgcaccgg cggcagcggc agcggcagca 26640 

acagcagcgg ccacacagaa gcaaaggcga ccggatagca agactctgac aaagcccaag 26700 

aaatccacag cggcggcagc agcaggagga ggagcgctgc gtctggcgcc caacgaaccc 26760 

gtatcgaccc gcgagcttag aaacaggatt tttcccactc tgtatgctat atttcaacag 26820 

agcaggggcc aagaacaaga gctgaaaata aaaaacaggt ctctgcgatc cctcacccgc 26880 

agctgcctgt atcacaaaag cgaagatcag cttcggcgca cgctggaaga cgcggaggct 26940 

ctcttcagta aatactgcgc gctgactctt aaggactagt ttcgcgccct ttctcaaatt 27000 

taagcgcgaa aactacgtca tctccagcgg ccacacccgg cgccagcacc tgtcgtcagc 27060 

gccattatga gcaaggaaat tcccacgccc tacatgtgga gttaccagcc acaaatggga 27120 

cttgcggctg gagctgccca agactactca acccgaataa actacatgag cgcgggaccc 27180 

cacatgatat cccgggtcaa cggaatccgc gcccaccgaa accgaattct cttggaacag 27240 

gcggctatta ccaccacacc tcgtaataac cttaatcccc gtagttggcc cgctgccctg 27300 

gtgtaccagg aaagtcccgc tcccaccact gtggtacttc ccagagacgc ccaggccgaa 27360 

gttcagatga ctaactcagg ggcgcagctt gcgggcggct ttcgtcacag ggtgcg[gtcg 27420 

cccgggcagg gtataactca cctgacaatc agagggcgag gtattcagct caacgacgag 27480 

tcggtgagct cctcgcttgg tctccgtccg gacgggacat ttcagatcgg cggcgccggc 27540 
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cgtccttcat tcacgcctcg tcaggcaatc 

cgctctggag gcattggaac tctgcaattt 

aaccccttct cgggacctcc cggccactat 

gtaaaggact cggcggacgg ctacgactga 

ctgaaacacc tggtccactg tcgccgccac 

tgctactttg aattgcccga ggatcatatc 

gcccagggag agcttgcccg tagcctgatt 

gagcgggaca ggggaccctg tgttctcact 

catcaagatc tttgttgcca tctctgtgct 

ctggggctcc tatcgccatc ctgtaaacgc 

gcgaacctta cctggtactt ttaacatctc 

agacggagtg agtctacgag agaacctctc 

caccctcctt acctgccggg aacgtacgag 

gcctgaccgt aaaccagact ttttccggac 

gaggtgagct tagaaaaccc ttagggtatt 

tgaacaattc aagcaactct acgggctatt 

cctggatgtc agcatctgac tttggccagc 

tacagcgacc caccctaaca gagatgacca 

ttacatctac cacaaataca ccccaagttt 

gcatgtggtg gttctccata gcgcttatgt 

gctgcctaaa gcgcaaacgc gcccgaccac 

caaacaatga tggaatccat agattggacg 

tatgattaaa tgagatctag aaatggacgg 

acgcagggca gcggccgagc aacagcgcat 

gcaccagtgc aaaaggggta tcttttgtct 

taataccacc ggacaccgcc ttagctacaa 

catggtggga gaaaagccca ttaccataac 

tcactcacct tgtcaaggac ctgaggatct 

caaagatctt attcccttta actaataaaa 

cagttagcaa atttctgtcc agtttattca 

ggtattgcag cttcctcctg gctgcaaact 

cctcctgttc ctgtccatcc gcacccacta 

gaccgtctga agataccttc aaccccgtgt 

ctgtgccttt tcttactcct ccctttgtat 

gggtactctc tttgcgccta tccgaacctc 

aaatgggcaa cggcctctct ctggacgagg 

ctgtgagccc acctctcaaa aaaaccaagt 

tcacagttac ctcagaagcc ctaactgtgg 

acacactcac catgcaatca caggccccgc 

ccacccaagg acccctcaca gtgtcagaag 

tcaccaccac cgatagcagt acccttacta 

ctggtagctt gggcattgac ttgaaagagc 

taaagtacgg ggctcctttg catgtaacag 

gtccaggtgt gactattaat aatacttcct 

ttgattcaca aggcaatatg caacttaatg 

acagacgcct tatacttgat gttagttatc 

gactaggaca gggccctctt tttataaact 

aaggccttta cttgtttaca gcttcaaaca 

ctgccaaggg gttgatgttt gacgctacag 

aatttggttc acctaatgca ccaaacacaa 

tagaatttga ttcaaacaag gctatggttc 

gcacaggtgc cattacagta ggaaacaaaa 

cagctccatc tcctaactgt agactaaatg 

taacaaaatg tggcagtcaa atacttgcta 

tggctccaat atctggaaca gttcaaagtg 

gagtgctact aaacaattcc ttcctggacc 

ttactgaagg cacagcctat acaaacgctg 

caaaatctca cggtaaaact gccaaaagta 

acaaaactaa acctgtaaca ctaaccatta 

caactccaag tgcatactct atgtcatttt 

atgaaatatt tgccacatcc tcttacactt 

tttgtgttat gtttcaacgt gtttattttt 

ttcagtagta tagccccacc accacatagc 

cacagaaccc tagtattcaa cctgccacct 

tctccccggc tggccttaaa aagcatcata 

atattccaca cggtttcctg tcgagccaaa 



ctaactctgc agacctcgtc ctctgagccg 27 600 
attgaggagt ttgtgccatc ggtctacttt 27660 
ccggatcaat ttattcctaa ctttgacgcg 27720 
atgttaagtg gagaggcaga gcaactgcgc 27780 
aagtgctttg cccgcgactc cggtgagttt 27840 
gagggcccgg cgcacggcgt ccggcttacc 27900 
cgggagttta cccagcgccc cctgctagtt 27960 
gtgatttgca actgtcctaa ccttggatta 28020 
gagtataata aatacagaaa ttaaaatata 28080 
caccgtcttc acccgcccaa gcaaaccaag 28140 
tccctctgtg atttacaaca gtttcaaccc 28200 
cgagctcagc tactccatca gaaaaaacac 28260 
tgcgtcaccg gccgctgcac cacacctacc 28320 
agacctcaat aactctgttt accagaacag 28380 
aggccaaagg cgcagctact gtggggttta 28440 
ctaattcagg tttctctaga agtcaggctt 28500 
acctgtcccg cggatttgtt ccagtccaac 28560 
acacaaccaa cgcggccgcc gctaccggac 28620 
ctgcctttgt caataactgg gataacttgg 28680 
ttgtatgcct tattattatg tggctcatct 28740 
ccatctatag tcccatcatt gtgctacacc 28800 
gactgaaaca catgttcttt tctcttacag 28860 
aattattaca gagcagcgcc tgctagaaag 28920 
gaatcaagag ctccaagaca tggttaactt 28980 
ggtaaagcag gccaaagtca cctacgacag 29040 
gttgccaacc aagcgtcaga aattggtggt 29100 
tcagcactcg gtagaaaccg aaggctgcat 29160 
ctgcaccctt attaagaccc tgtgcggtct 29220 
aaaaataata aagcatcact tacttaaaat 29280 
gcagcacctc cttgccctcc tcccagctct 29340 
ttctccacaa tctaaatgga atgtcagttt 29400 
tcttcatgtt gttgcagatg aagcgcgcaa 29460 
atccatatga cacggaaacc ggtcctccaa 29520 
cccccaatgg gtttcaagag agtccccctg 29580 
tagttacctc caatggcatg cttgcgctca 29640 
ccggcaacct tacctcccaa aatgtaacca 29700 
caaacataaa cctggaaata tctgcacccc 29760 
ctgccgccgc acctctaatg gtcgcgggca 29820 
taaccgtgca cgactccaaa cttagcattg 29880 
gaaagctagc cctgcaaaca tcaggccccc 29940 
tcactgcctc accccctcta actactgcca 30000 
ccatttatac acaaaatgga aaactaggac 30060 
acgacctaaa cactttgacc gtagcaactg 30120 
tgcaaactaa agttactgga gccttgggtt 30180 
tagcaggagg actaaggatt gattctcaaa 30240 
cgtttgatgc tcaaaaccaa ctaaatctaa 30300 
cagcccacaa cttggatatt aactacaaca 30360 
attccaaaaa gcttgaggtt aacctaagca 30420 
ccatagccat taatgcagga gatgggcttg 30480 
atcccctcaa aacaaaaatt ggccatggcc 30540 
ctaaactagg aactggcctt agttttgaca 30600 
ataatgataa gctaactttg tggaccacac 30660 
cagagaaaga tgctaaactc actttggtct 30720 
cagtttcagt tttggctgtt aaaggcagtt 30780 
ctcatcttat tataagattt gacgaaaatg 30840 
cagaatattg gaactttaga aatggagatc 30900 
ttggatttat gcctaaccta tcagcttatc 30960 
acattgtcag tcaagtttac ttaaacggag 31020 
cactaaacgg tacacaggaa acaggagaca 31080 
catgggactg gtctggccac aactacatta 31140 
tttcatacat tgcccaagaa taaagaatcg 31200 
caattgcaga aaatttcaag tcatttttca 31260 
ttatacagat caccgtacct taatcaaact 31320 
ccctcccaac acacagagta cacagtcctt 31380 
tcatgggtaa cagacatatt cttaggtgtt 31440 
cgctcatcag tgatattaat aaactccccg 31500 
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ggcagctcac ttaagttcat gtcgctgtcc agctgctgag ccacaggctg ctgtccaact 31560 
tgcggttgct taacgggcgg cgaaggagaa gtccacgcct acatgggggt agagtcataa 31620 
tcgtgcatca ggatagggcg gtggtgctgc agcagcgcgc gaataaactg ctgccgccgc 31680 
cgctccgtcc tgcaggaata caacatggca gtggtctcct cagcgatgat tcgcaccgcc 31740 
cgcagcataa ggcgccttgt cctccgggca cagcagcgca ccctgatctc acttaaatca 31800 
gcacagtaac tgcagcacag caccacaata ttgttcaaaa tcccacagtg caaggcgctg 31860 
tatccaaagc tcatggcggg gaccacagaa cccacgtggc catcatacca caagcgcagg 31920 
tagattaagt ggcgacccct cataaacacg ctggacataa acattacctc ttttggcatg 319B0 
ttgtaattca ccacctcccg gtaccatata aacctctgat taaacatggc gccatccacc 32040 
accatcctaa accagctggc caaaacctgc ccgccggcta tacactgcag ggaaccggga 32100 
ctggaacaat gacagtggag agcccaggac tcgtaaccat ggatcatcat gctcgtcatg 32160 
atatcaatgt tggcacaaca caggcacacg tgcatacact tcctcaggat tacaagctcc 32220 
tcccgcgtta gaaccatatc ccagggaaca acccattcct gaatcagcgt aaatcccaca 32280 
ctgcagggaa gacctcgcac gtaactcacg ttgtgcattg tcaaagtgtt acattcgggc 32340 
agcagcggat gatcctccag tatggtagcg cgggtttctg tctcaaaagg aggtagacga 32400 
tccctactgt acggagtgcg ccgagacaac cgagatcgtg ttggtcgtag tgtcatgcca 324 60 
aatggaacgc cggacgtagt catatttcct gaagcaaaac caggtgcggg cgtgacaaac 32520 
agatctgcgt ctccggtctc gccgcttaga tcgctctgtg tagtagttgt agtatatcca 32580 
ctctctcaaa gcatccaggc gccccctggc ttcgggttct atgtaaactc cttcatgcgc 32640 
cgctgccctg ataacatcca ccaccgcaga ataagccaca cccagccaac ctacacattc 32700 
gttctgcgag tcacacacgg gaggagcggg aagagctgga agaaccatgt tttttttttt 32760 
attccaaaag attatccaaa acctcaaaat gaagatctat taagtgaacg cgctcccctc 32820 
cggtggcgtg gtcaaactct acagccaaag aacagataat ggcatttgta agatgttgca 32880 
caatggcttc caaaaggcaa acggccctca cgtccaagtg gacgtaaagg ctaaaccctt 32940 
cagggtgaat ctcctctata aacattccag caccttcaac catgcccaaa taattctcat 33000 
ctcgccacct tctcaatata tctctaagca aatcccgaat attaagtccg gccattgtaa 33060 
aaatctgctc cagagcgccc tccaccttca gcctcaagca gcgaatcatg attgcaaaaa 33120 
ttcaggttcc tcacagacct gtataagatt caaaagcgga acattaacaa aaataccgcg 33180 
atcccgtagg tcccttcgca gggccagctg aacataatcg tgcaggtctg cacggaccag 33240 
cgcggccact tccccgccag gaaccttgac aaaagaaccc acactgatta tgacacgcat 33300 
actcggagct atgctaacca gcgtagcccc gatgtaagct ttgttgcatg ggcggcgata 33360 
taaaatgcaa ggtgctgctc aaaaaatcag gcaaagcctc gcgcaaaaaa gaaagcacat 33420 
cgtagtcatg ctcatgcaga taaaggcagg taagctccgg aaccaccaca gaaaaagaca 33480 
ccatttttct ctcaaacatg tctgcgggtt tctgcataaa cacaaaataa aataacaaaa 33540 
aaacatttaa acattagaag cctgtcttac aacaggaaaa acaaccctta taagcataag 33600 
acggactacg gccatgccgg cgtgaccgta aaaaaactgg tcaccgtgat taaaaagcac 33660 
caccgacagc tcctcggtca tgtccggagt cataatgtaa gactcggtaa acacatcagg 33720 
ttgattcatc ggtcagtgct aaaaagcgac cgaaatagcc cgggggaata catacccgca 33780 
ggcgtagaga caacattaca gcccccatag gaggtataac aaaattaata ggagagaaaa 33840 
acacataaac acctgaaaaa ccctcctgcc taggcaaaat agcaccctcc cgctccagaa 33900 
caacatacag cgcttcacag cggcagccta acagtcagcc ttaccagtaa aaaagaaaac 33960 
ctattaaaaa aacaccactc gacacggcac cagctcaatc agtcacagtg taaaaaaggg 34020 
ccaagtgcgt tacactgcag caggtgtgac tcagccatgg cacctctgca gcctgggtac 34080 
cctgcttggg gcatggcccc ttatagctgg gcggggcgtg ggggctctgt aggagtggca 34140 
gcgacctcag tgtttgtctt tgctctgaag agccctccag gtgcttgatc ccaccttttc 34200 
ccagcaggaa cactcctgcc tgccttacca cctgtcctgg ctgatggcct gttcctgcct 34260 
cctttgcccc ctgcccagac tcccatgttc ctggacttgt ggcttcctcc aaccaggggc 34320 
tctcaagcct ccatacctgg tcccacctct ccaggccgtg ggagggaggt tgaggagggt 34380 
ggagggcatc tggttggggg cagcctgggt gttcccctcc catcccctcc ctgggcctcc 34440 
caggccccct ctactcttga gcaatgctct tgagagcttc ctgcctggct cttaacccag 34500 
ggcaagccct ggaagggcag acccaggaca ctctcaccac ctccttacct tttcccctgg 34560 
aaaaatcttc tgtatacttc ccattttaag aaaactacaa ttcccaacac atacaagtta 34620 
ctccgcccta aaacctacgt cacccgcccc gttcccacgc cccgcgccac gtcacaaac^ 34680 
ccaccccctc attatcatat tggcttcaat ccaaaataag gtatattatt gatgatg 34737 

<210> 16 
<211> 36114 
<212> UNA 

<213> Adenovirus subgroup C 
<400> 16 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
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agtgaaatct gaataatttt gtgttactca 
gactttgacc gtttacgtgg agactcgccc 
cgggtcaaag ttggcgtttt attattatag 
tgagttcctc aagaggccac tcttgagtgc 
tccgacaccg ggactgaaaa tgagacatga 
ccattttgaa ccacctaccc ttcacgaact 
tcccaacgag gaggcggttt cgcagatttt 
agggattgac ttactcactt ttccgccggc 
ccggcagccc gagcagccgg agcagagagc 
tccacccagt gacgacgagg atgaagaggg 
ccccgggcac ggttgcaggt cttgtcatta 
tatgtgttcg ctttgctata tgaggacctg 
atgggcagtg ggtgatagag tggtgggttt 
gttttgtggt ttaaagaatt ttgtattgtg 
gagcctgagc ccgagccaga accggagcct 
cctgctatcc tgagacgccc gacatcacct 
agctgtgact ccggtccttc taacacacct 
cccattaaac cagttgccgt gagagttggt 
gacttgctta acgagcctgg • gcaacctttg 
ggtgtaaacc tgtgattgcg tgtgtggtta 
agtttaataa agggtgagat aatgtttaac 
aaagggtata taatgcgccg tgggctaatc 
gagtgtttgg aagatttttc tgctgtgcgt 
tcttggtttt ggaggtttct gtggggctca 
gaggattaca agtgggaatt tgaagagctt 
ttgaatctgg gtcaccaggc gcttttccaa 
acaccggggc gcgctgcggc tgctgttgct 
gaagaaaccc atctgagcgg ggggtacctg 
gcggttgtga gacacaagaa tcgcctgcta 
ccgacggagg agcagcagca gcagcaggag 
ccatggaacc cgagagccgg cctggaccct 
tgtatccaga actgagacgc attttgacaa 
taaagaggga gcggggggct tgtgaggcta 
taatgaccag acaccgtcct gagtgtatta 
atgagcttga tctgctggcg cagaagtatt 
agccagggga tgattttgag gaggctatta 
attgcaagta caagatcagc aaacttgtaa 
acggggccga ggtggagata gatacggagg 
atatgtggcc gggggtgctt ggcatggacg 
gccccaattt tagcggtacg gttttcctgg 
gcttctatgg gtttaacaat acctgtgtgg 
gtgcctttta ctgctgctgg aagggggtgg 
agaaatgcct ctttgaaagg tgtaccttgg 
gccacaatgt ggcctccgac tgtggttgct 
agcataacat ggtatgtggc aactgcgagg 
acggcaactg tcacctgctg aagaccattc 
cagtgtttga gcataacata . ctgacccgct 
tgttcctacc ttaccaatgc aatttgagtc 
tgtccaaggt gaacctgaac ggggtgtttg 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 
aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 



tagcgcgtaa tatttgtcta gggccgcggg 360 
aggtgttttt ctcaggtgtt ttccgcgttc 420 
tcagctgacg tgtagtgtat ttatacccgg 480 
cagcgagtag agttttctcc tccgagccgc 540 
ggtactggct gataatcttc cacctcctag 600 
gtatgattta gacgtgacgg cccccgaaga 660 
tcccgactct gtaatgttgg cggtgcagga 720 
gcccggttct ccggagccgc ctcacctttc 780 
cttgggtccg gtttgccacg aggctggctt 840 
tgaggagttt gtgttagatt atgtggagca 900 
tcaccggagg aatacggggg acccagatat 960 
tggcatgttt gtctacagta agtgaaaatt 1020 
ggtgtggtaa tttttttttt aatttttaca 1080 
atttttttaa aaggtcctgt gtctgaacct 1140 
gcaagaccta cccgccgtcc taaaatggcg 1200 
gtgtctagag aatgcaatag tagtacggat 1260 
cctgagatac acccggtggt cccgctgtgc 1320 
gggcgtcgcc aggctgtgga atgtatcgag 1380 
gacttgagct gtaaacgccc caggccataa 14 40 
acgcctttgt ttgctgaatg agttgatgta 1500 
ttgcatggcg tgttaaatgg ggcggggctt 1560 
ttggttacat ctgacctcat ggaggcttgg 1620 
aacttgctgg aacagagctc taacagtacc 1680 
tcccaggcaa agttagtctg cagaattaag 1740 
ttgaaatcct gtggtgagct gtttgattct 1800 
gagaaggtca tcaagacttt ggatttttcc 1860 
tttttgagtt ttataaagga taaatggagc 1920 
ctggattttc tggccatgca tctgtggaga 1980 
ctgttgtctt ccgtccgccc ggcgataata 2040 
gaagccaggc ggcggcggca ggagcagagc 2100 
cgggaatgaa tgttgtacag gtggctgaac 2160 
ttacagagga tgggcagggg ctaaaggggg 2220 
cagaggaggc taggaatcta gcttttagct 2280 
cttttcaaca gatcaaggat aattgcgcta 2340 
ccatagagca gctgaccact tactggctgc 2400 
gggtatatgc aaaggtggca cttaggccag 2460 
atatcaggaa ttgttgctac atttctggga 2520 
atagggtggc ctttagatgt agcatgataa 2580 
gggtggttat tatgaatgta aggtttactg 2640 
ccaataccaa ccttatccta cacggtgtaa 2700 
aagcctggac cgatgtaagg gttcggggct 2760 
tgtgtcgccc caaaagcagg gcttcaatta 2820 
gtatcctgtc tgagggtaac tccagggtgc 2880 
tcatgctagt gaaaagcgtg gctgtgatta 2940 
acagggcctc tcagatgctg acctgctcgg 3000 
acgtagccag ccactctcgc aaggcctggc 3060 
gttccttgca tttgggtaac aggagggggg 3120 
acactaagat attgcttgag cccgagagca 3180 
acatgaccat gaagatctgg aaggtgctga 3240 
cctgcgagtg tggcggtaaa catattagga 3300 
agctgaggcc cgatcacttg gtgctggcct 3360 
aagatacaga ttgaggtact gaaatgtgtg 3420 
aaggtggggg tcttatgtag ttttgtatct 3480 
ccaactcgtt tgatggaagc attgtgagct 3540 
ccggggtgcg tcagaatgtg atgggctcca 3600 
actctactac cttgacctac gagaccgtgt 3660 
ccgccgcttc agccgctgca gccaccgccc 3720 
gcccgcttgc aagcagtgca gcttcccgtt 3780 
ttttggcaca attggattct ttgacccggg 3840 
atctgcgcca gcaggtttct gccctgaagg 3900 
taaataaaaa accagactct gtttggattt 3960 
taggggtttt gcgcgcgcgg taggcccggg 4020 
gtattttttc caggacgtgg taaaggtgac 4080 
cgtctctggg gtggaggtag caccactgca 4140 
tgatccagtc gtagcaggag cgctgggcgt 4200 
tgattgccag gggcaggccc ttggtgtaag 4260 
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tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 
gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 
gggcagcagg cgcgcgtcga agtagtctat 
ccatgcgcgg gcggcaagcg cgcgctcgta 
gtgggtgagc gcggaggcgt acatgccgca 
tattccaaga tatgtagggt agcatcttcc 
tagttcgtgc gagggagcga ggaggtcggg 
tcggaagact atctgcctga agatggcatg 
gacgttgaag ctggcgtctg tgagacctac 
gcgcagcttg ttgaccagct cggcggtgac 
ttccttgatg atgtcatact tatcctgtcc 
aaactcttcg cggtctttcc agtactcttg 
agagcctagc atgtagaact ggttgacggc 
tagcgcgtat gcctgcgcgg ccttccggag 
gaccatgact ttgaggtact ggtatttgaa 
gagcaaaaag tccgtgcgct ttttggaacg 
gaagagtatc tttcccgcgc gaggcataaa 
ctcggaacgg ttgttaatta cctgggcggc 
gtggcccaca atgtaaagtt ccaagaagcg 
aagttcctcg taggtgagct cttcagggga 
tgcaagatga gggttggaag cgacgaatga 
caggtggtcg cgaaaggtcc taaactggcg 
gtagaaggta agcgggtctt gttcccagcg 
cgcggcagtc actagaggct catctccgcc 
ctgcttccca aaggccccca tccaagtata 
ctcggtgcga ggatgcgagc cgatcgggaa 
gtggctattg atgtggtgaa agtagaagtc 
tttgtaaaaa cgtgcgcagt actggcagcg 
gacctgacga ccgcgcacaa ggaagcagag 
tggctggtgg tcttctactt cggctgcttg 
tacggtggat cggaccacca cgccgcgcga 



gcatacgtgg ggatatgaga tgcatcttgg 4320 
ccatatccct ccggggattc atgttgtgca 4380 
tgggaaattt gtcatgtagc ttagaaggaa 4 4 40 
gacctccaag attttccatg cattcgtcca 4500 
cctgggcgaa gatatttctg ggatcactaa 4560 
cataggccat ttttacaaag cgcgggcgga 4 620 
ccggcccagg ggcgtagtta ccctcacaga 4680 
gggggatcat gtctacctgc ggggcgatga 474 0 
gctgggaaga aagcaggttc ctgagcagct 4800 
tcacacctat taccgggtgc aactggtagt 4860 
gcaggggggc cacttcgtta agcatgtccc 4 920 
ccagaaggcg ctcgccgccc agcgatagca 4980 
gtttgagacc gtccgccgta ggcatgcttt 5040 
cccacagctc ggtcacctgc tctacggcat 5100 
gttggggcgg ctttcgctgt acggcagtag 5160 
gtctttccac gggcgcaggg tcctcgtcag 5220 
tccgggctgc gcgctggcca gggtgcgctt 5280 
ccggtcttcg ccctgcgcgt cggccaggta 5340 
ctccgcggcg tggcccttgg cgcgcagctt 5400 
gtgcagactt ttgagggcgt agagcttggg 5460 
atccgcgccg caggccccgc agacggtctc 5520 
ttcggggtca aaaaccaggt ttcccccatg 5580 
catgagccgg tgtccacgct cggtgacgaa 5640 
gagaggcctg tcctcgagcg gtgttccgcg 5700 
tgagacaaag gctcgcgtcc aggccagcac 57 60 
gttgtccact agggggtcca ctcgctccag 5820 
atcaaggaag gtgattggtt tgtaggtgta 5880 
gctataaaag ggggtggggg cgcgttcgtc 5940 
ggccagctgt tggggtgagt actccctctg 6000 
gtcagtttcc aaaaacgagg aggatttgat 6060 
ggtggccgca tccatctggt cagaaaagac 6120 
cgacccgtag agggcgttgg acagcaactt 6180 
gcgatcggcg cgctccttgg ccgcgatgtt 6240 
ccattcggga aagacggtgg tgcgctcgtc 6300 
gtgcagggtg acaaggtcaa cgctggtggc 6360 
gcagaggcgg ccgcccttgc gcgagcagaa 6420 
cggggggtct gcgtccacgg taaagacccc 6480 
cttgcatcct tgcaagtcta gcgcctgctg 6540 
tgggttgagt gggggacccc atggcatggg 6600 
aatgtcgtaa acgtagaggg gctctctgag 6660 
accgcggatg ctggcgcgca cgtaatcgta 6720 
accgaggttg ctacgggcgg gctgctctgc 6780 
tgagttggat gatatggttg gacgctggaa 6840 
cgcgtcacgc acgaaggagg cgtaggagtc 6900 
ctgcacgtct agggcgcagt agtccagggt 6960 
cttttttttc cacagctcgc ggttgaggac 7020 
gatcggaaac ccgtcggcct ccgaacggta 7080 
ctggtaggcg cagcatccct tttctacggg 7140 
cgaggtgtgg gtgagcgcaa aggtgtccct 7200 
gtcagtgtcg tcgcatccgc cctgctccca 7260 
cggatttggc agggcgaagg tgacatcgtt 7320 
gttgcgtgtg atgcggaagg gtcccggcac 7380 
gagcacgatc tcgtcaaagc cgttgatgtt 7440 
cgggatgccc ttgatggaag gcaatttttt 7500 
gctgagcccg tgctctgaaa gggcccagtc 7560 
gctccacagg tcacgggcca ttagcatttg 7620 
acctatggcc attttttctg gggtgatgca 7 680 
gtcccatcca aggttcgcgg ctaggtctcg 7740 
gaacttcatg accagcatga agggcacgag 7800 
ggtctctaca tcgtaggtga caaagagacg 7860 
gaactggatc tcccgccacc aattggagga 7920 
cctgcgacgg gccgaacact cgtgctggct 7980 
gtgcacgggc tgtacatcct gcacgaggtt 8040 
tgggaatttg agcccctcgc ctggcgggtt 8100 
tccttgaccg tctggctgct cgaggggagt 8160 
gcccaaagtc cagatgtccg cgcgcggcgg 8220 



WO 01/04282 



68 



PCT/US00/18971 



tC ggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 8280 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 8340 
tagatccagg tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 8400 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 84 60 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 8520 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 8580 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 8640 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 8700 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 8760 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 8820 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 8880 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 8 940 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 9000 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 9060 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 9120 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 9180 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 9240 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 9300 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 9360 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 9420 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 9480 
gqgctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 9540 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 9600 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 9660 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 9720 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 9780 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 9840 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 9900 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc gccctcttcc tcccatgcgt 9960 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 10020 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 10080 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 10140 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 10200 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 10260 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 10320 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 10380 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 10440 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagcg tgcaaaagga 10500 
gagcctgtaa gcgggcactc ttccgtggtc tggtggataa attcgcaagg gtatcatggc 10560 
ggacgaccgg ggttcgagcc ccgtatccgg ccgtccgccg tgatccatgc ggttaccgcc 10620 
cgcgtgtcga acccaggtgt gcgacgtcag acaacggggg agtgctcctt ttggcttcct 10680 
tccaggcgcg gcggctgctg cgctagcttt tttggccact ggccgcgcgc agcgtaagcg 10740 
gttaggctgg aaagcgaaag cattaagtgg ctcgctccct gtagccggag ggttattttc 10800 
caagggttga gtcgcgggac ccccggttcg agtctcggac cggccggact gcggcgaacg 10860 
ggggtttgcc tccccgtcat gcaagacccc gcttgcaaat tcctccggaa acagggacga 10920 
gccccttttt tgcttttccc agatgcatcc ggtgctgcgg cagatgcgcc cccctcctca 10980 
gcagcggcaa gagcaagagc agcggcagac atgcagggca ccctcccctc ctcctaccgc 11040 
gtcaggaggg gcgacatccg cggttgacgc ggcagcagat ggtgattacg aacccccgcg 11100 
gcgccgggcc cggcactacc tggacttgga ggagggcgag ggcctggcgc ggctaggagc 11160 
gccctctcct gagcggtacc caagggtgca gctgaagcgt gatacgcgtg aggcgtacgt 11220 
gccgcggcag aacctgtttc gcgaccgcga gggagaggag cccgaggaga tgcgggatcg 11280 
aaagttccac gcagggcgcg agctgcggca tggcctgaat cgcgagcggt tgctgcgcga 11340 
ggaggacttt gagcccgacg cgcgaaccgg gattagtccc gcgcgcgcac acgtggcggc 11400 
cgccgacctg gtaaccgcat acgagcagac ggtgaaccag gagattaact ttcaaaaaag 11460 
ctttaacaac cacgtgcgta cgcttgtggc gcgcgaggag gtggctatag gactgatgca 11520 
tctgtgggac tttgtaagcg cgctggagca aaacccaaat agcaagccgc tcatggcgca 11580 
gctgttcctt atagtgcagc acagcaggga caacgaggca ttcagggatg cgctgctaaa 11640 
catagtagag cccgagggcc gctggctgct cgatttgata aacatcctgc agagcatagt 11700 
ggtgcaggag cgcagcttga gcctggctga caaggtggcc gccatcaact attccatgct 11760 
tagcctgggc aagttttacg cccgcaagat ataccatacc ccttacgttc ccatagacaa 11820 
ggaggtaaag atcgaggggt tctacatgcg catggcgctg aaggtgctta ccttgagcga 11880 
cgacctgggc gtttatcgca acgagcgcat ccacaaggcc gtgagcgtga gccggcggcg 11940 
cgagctcagc gaccgcgagc tgatgcacag cctgcaaagg gccctggctg gcacgggcag 12000 
cggcgataga gaggccgagt cctactttga cgcgggcgct gacctgcgct gggccccaag 12060 
ccgacgcgcc ctggaggcag ctggggccgg acctgggctg gcggtggcac ccgcgcgcgc 12120 
tggcaacgtc ggcggcgtgg aggaatatga cgaggacgat gagtacgagc cagaggacgg 12180 
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cgagtactaa gcggtgatgt ttctgatcag 
cgggcggcgc tgcagagcca gccgtccggc 
atggaccgca tcatgtcgct gactgcgcgc 
gccaaccggc tctccgcaat tctggaagcg 
gagaaggtgc tggcgatcgt aaacgcgctg 
gccggcctgg tctacgacgc gctgcttcag 
cagaccaacc tggaccggct ggtgggggat 
gcgcagcagc agggcaacct gggctccatg 
cccgccaacg tgccgcgggg acaggaggac 
atggtgactg agacaccgca aagtgaggtg 
accagtagac aaggcctgca gaccgtaaac 
ctgtgggggg tgcgggctcc cacaggcgac 
aactcgcgcc tgttgctgct gctaatagcg 
gacacatacc taggtcactt gctgacactg 
gacgagcata ctttccagga gattacaagt 
ggcagcctgg aggcaaccct aaactacctg 
ttgcacagtt taaacagcga ggaggagcgc 
cttaacctga tgcgcgacgg ggtaacgccc 
atggaaccgg gcatgtatgc ctcaaaccgg 
catcgcgcgg ccgccgtgaa ccccgagtat 
ctaccgcccc ctggtttcta caccggggga 
ctctgggacg acatagacga cagcgtgttt 
caacagcgcg agcaggcaga ggcggcgctg 
ttgtccgatc taggcgctgc ggccccgcgg 
atagggtctc ttaccagcac tcgcaccacc 
ctaaacaact cgctgctgca gccgcagcgc 
aacgggatag agagcctagt ggacaagatg 
agggacgtgc caggcccgcg cccgcccacc 
ctggtgtggg aggacgatga ctcggcagac 
ggcaacccgt ttgcgcacct tcgccccagg 
atgatgcaaa ataaaaaact caccaaggcc 
cccttagtat gcggcgcgcg gcgatgtatg 
tggtgagcgc ggcgccagtg gcggcggcgc 
cgccgtttgt gcctccgcgg tacctgcggc 
ctgagttggc acccctattc gacaccaccc 
atgtggcatc cctgaactac cagaacgacc 
acaatgacta cagcccgggg gaggcaagca 
actggggcgg cgacctgaaa accatcctgc 
tgtttaccaa taagtttaag gcgcgggtga 
aggtggagct gaaatacgag tgggtggagt 
ccatgaccat agaccttatg aacaacgcga 
agaacggggt tctggaaagc gacatcgggg 
ggtttgaccc cgtcactggt cttgtcatgc 
cagacatcat tttgctgcca ggatgcgggg 
tgttgggcat ccgcaagcgg caacccttcc 
tggagggtgg taacattccc gcactgttgg 
atgacaccga acagggcggg ggtggcgcag 
aagagaactc caacgcggca gccgcggcaa 
ccattcgcgg cgacaccttt gccacacggg 
cggccgaagc tgccgccccc gctgcgcaac 
tgatcaaacc cctgacagag gacagcaaga 
gcaccttcac ccagtaccgc agctggtacc 
gaatccgctc atggaccctg ctttgcactc 
actggtcgtt gccagacatg atgcaagacc 
gcaactttcc ggtggtgggc gccgagctgt 
accaggccgt ctactcccaa ctcatccgcc 
gctttcccga gaaccagatt ttggcgcgcc 
aaaacgttcc tgctctcaca gatcacggga 
tccagcgagt gaccattact gacgccagac 
tgggcatagt ctcgccgcgc gtcctatcga 
ttatatcgcc cagcaataac acaggctggg 
gggccaagaa gcgctccgac caacacccag 
ggggcgcgca caaacgcggc cgcactgggc 
tggtggagga ggcgcgcaac tacacgccca 
ccattcagac cgtggtgcgc ggagcccggc 
gcgtagcacg tcgccaccgc cgccgacccg 



atgatgcaag acgcaacgga cccggcggtg 12240 
cttaactcca cggacgactg gcgccaggtc 12300 
aatcctgacg cgttccggca gcagccgcag 12360 
gtggtcccgg cgcgcgcaaa ccccacgcac 12420 
gccgaaaaca gggccatccg gcccgacgag 12480 
cgcgtggctc gttacaacag cggcaacgtg 12540 
gtgcgcgagg ccgtggcgca gcgtgagcgc 12600 
gttgcactaa acgccttcct gagtacacag 12660 
tacaccaact ttgtgagcgc actgcggcta 12720 
taccagtctg ggccagacta ttttttccag 12780 
ctgagccagg ctttcaaaaa cttgcagggg 12840 
cgcgcgaccg tgtctagctt gctgacgccc 12900 
cccttcacgg acagtggcag cgtgtcccgg 12960 
taccgcgagg ccataggtca ggcgcatgtg 13020 
gtcagccgcg cgctggggca ggaggacacg 13080 
ctgaccaacc ggcggcagaa gatcccctcg 13140 
attttgcgct acgtgcagca gagcgtgagc 13200 
agcgtggcgc tggacatgac cgcgcgcaac 13260 
ccgtttatca accgcctaat ggactacttg 13320 
ttcaccaatg ccatcttgaa cccgcactgg 13380 
ttcgaggtgc ccgagggtaa cgatggattc 13440 
tccccgcaac cgcagaccct gctagagttg 13500 
cgaaaggaaa gcttccgcag gccaagcagc 13560 
tcagatgcta gtagcccatt tccaagcttg 13620 
cgcccgcgcc tgctgggcga ggaggagtac 13680 
gaaaaaaacc tgcctccggc atttcccaac 13740 
agtagatgga agacgtacgc gcaggagcac 13800 
cgtcgtcaaa ggcacgaccg tcagcggggt 13860 
gacagcagcg tcctggattt gggagggagt 13920 
ctggggagaa tgttttaaaa aaaaaaaagc 13980 
atggcaccga gcgttggttt tcttgtattc 14040 
aggaaggtcc tcctccctcc tacgagagtg 14100 
tgggttctcc cttcgatgct cccctggacc 14160 
ctaccggggg gagaaacagc atccgttact 14220 
gtgtgtacct ggtggacaac aagtcaacgg 14280 
acagcaactt tctgaccacg gtcattcaaa 14340 
cacagaccat caatcttgac gaccggtcgc 14400 
ataccaacat gccaaatgtg aacgagttca 144 60 
tggtgtcgcg cttgcctact aaggacaatc 14520 
tcacgctgcc cgagggcaac tactccgaga 14580 
tcgtggagca ctacttgaaa gtgggcagac 14 640 
taaagtttga cacccgcaac ttcagactgg 14700 
ctggggtata tacaaacgaa gccttccatc 14760 
tggacttcac ccacagccgc ctgagcaact 14820 
aggagggctt taggatcacc tacgatgatc 14880 
atgtggacgc ctaccaggcg agcttgaaag 14940 
gcggcagcaa cagcagtggc agcggcgcgg 15000 
tgcagccggt ggaggacatg aacgatcatg 15060 
ctgaggagaa gcgcgctgag gccgaagcag 15120 
ccgaggtcga gaagcctcag aagaaaccgg 15180 
aacgcagtta caacctaata agcaatgaca 15240 
ttgcatacaa ctacggcgac cctcagaccg 15300 
ctgacgtaac ctgcggctcg gagcaggtct 15360 
ccgtgacctt ccgctccacg cgccagatca 15420 
tgcccgtgca ctccaagagc ttctacaacg 15480 
agtttacctc tctgacccac gtgttcaatc 15540 
cgccagcccc caccatcacc accgtcagtg 15600 
cgctaccgct gcgcaacagc atcggaggag 15660 
gccgcacctg cccctacgtt tacaaggccc 15720 
gccgcacttt ttgagcaagc atgtccatcc 15780 
gcctgcgctt cccaagcaag atgtttggcg 15840 
tgcgcgtgcg cgggcactac cgcgcgccct 15900 
gcaccaccgt cgatgacgcc atcgacgcgg 15960 
cgccgccacc agtgtccaca gtggacgcgg 16020 
gctatgctaa aatgaagaga cggcggaggc 16080 
gcactgccgc ccaacgcgcg gcggcggccc 16140 
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tgcttaaccg cgcacgtcgc accggccgac gggcggccat gcgggccgct cgaaggctgg 16200 
ccgcgggtat tgtcactgtg ccccccaggt ccaggcgacg agcggccgcc gcagcagccg 16260 
cggccattag tgctatgact cagggtcgca ggggcaacgt gtattgggtg cgcgactcgg 16320 
ttagcggcct gcgcgtgccc gtgcgcaccc gccccccgcg caactagatt gcaagaaaaa 16380 
actacttaga ctcgtactgt tgtatgtatc cagcggcggc ggcgcgcaac gaagctatgt 164 40 
ccaagcgcaa aatcaaagaa gagatgctcc aggtcatcgc gccggagatc tatggccccc 16500 
cgaagaagga agagcaggat tacaagcccc gaaagctaaa gcgggtcaaa aagaaaaaga 16560 
aagatgatga tgatgaactt gacgacgagg tggaactgct gcacgctacc gcgcccaggc 16620 
gacgggtaca gtggaaaggt cgacgcgtaa aacgtgtttt gcgacccggc accaccgtag 16680 
tctttacgcc cggtgagcgc tccacccgca cctacaagcg cgtgtatgat gaggtgtacg 16740 
gcgacgagga cctgcttgag caggccaacg agcgcctcgg ggagtttgcc tacggaaagc 16800 
ggcataagga catgctggcg ttgccgctgg acgagggcaa cccaacacct agcctaaagc 16860 
ccgtaacact gcagcaggtg ctgcccgcgc ttgcaccgtc cgaagaaaag cgcggcctaa 16920 
agcgcgagtc tggtgacttg gcacccaccg tgcagctgat ggtacccaag cgccagcgac 16980 
tggaagatgt cttggaaaaa atgaccgtgg aacctgggct ggagcccgag gtccgcgtgc 17040 
ggccaatcaa gcaggtggcg ccgggactgg gcgtgcagac cgtggacgtt cagataccca 17100 
ctaccagtag caccagtatt gccaccgcca cagagggcat ggagacacaa acgtccccgg 17160 
ttgcctcagc ggtggcggat gccgcggtgc aggcggtcgc tgcggccgcg tccaagacct 17220 
ctacggaggt gcaaacggac ccgtggatgt ttcgcgtttc agccccccgg cgcccgcgcg 17280 
gttcgaggaa gtacggcgcc gccagcgcgc tactgcccga atatgcccta catccttcca 17340 
ttgcgcctac ccccggctat cgtggctaca cctaccgccc cagaagacga gcaactaccc 17400 
gacgccgaac caccactgga acccgccgcc gccgtcgccg tcgccagccc gtgctggccc 17460 
cgatttccgt gcgcagggtg gctcgcgaag gaggcaggac cctggtgctg ccaacagcgc 17520 
gctaccaccc cagcatcgtt taaaagccgg tctttgtggt tcttgcagat atggccctca 17580 
cctgccgcct ccgtttcccg gtgccgggat tccgaggaag aatgcaccgt aggaggggca 17640 
tggccggcca cggcctgacg ggcggcatgc gtcgtgcgca ccaccggcgg cggcgcgcgt 17700 
cgcaccgtcg catgcgcggc ggtatcctgc ccctccttat tccactgatc gccgcggcga 17760 
ttggcgccgt gcccggaatt gcatccgtgg ccttgcaggc gcagagacac tgattaaaaa 17820 
caagttgcat gtggaaaaat caaaataaaa agtctggact ctcacgctcg cttggtcctg 17880 
taactatttt gtagaatgga agacatcaac tttgcgtctc tggccccgcg acacggctcg 17940 
cgcccgttca tgggaaactg gcaagatatc ggcaccagca atatgagcgg tggcgccttc 18000 
agctggggct cgctgtggag cggcattaaa aatttcggtt ccaccgttaa gaactatggc 18060 
agcaaggcct ggaacagcag cacaggccag atgctgaggg ataagttgaa agagcaaaat 18120 
ttccaacaaa aggtggtaga tggcctggcc tctggcatta gcggggtggt ggacctggcc 18180 
aaccaggcag tgcaaaataa gattaacagt aagcttgatc cccgccctcc cgtagaggag 18240 
cctccaccgg ccgtggagac agtgtctcca gaggggcgtg gcgaaaagcg tccgcgcccc 18300 
gacagggaag aaactctggt gacgcaaata gacgagcctc cctcgtacga ggaggcacta 18360 
aagcaaggcc tgcccaccac ccgtcccatc gcgcccatgg ctaccggagt gctgggccag 18420 
cacacacccg taacgctgga cctgcctccc cccgccgaca cccagcagaa acctgtgctg 18480 
ccaggcccga ccgccgttgt tgtaacccgt cctagccgcg cgtccctgcg ccgcgccgcc 18540 
agcggtccgc gatcgttgcg gcccgtagcc agtggcaact ggcaaagcac actgaacagc 18600 
atcgtgggtc tgggggtgca atccctgaag cgccgacgat gcttctgaat agctaacgtg 18660 
tcgtatgtgt gtcatgtatg cgtccatgtc gccgccagag gagctgctga gccgccgcgc 18720 
gcccgctttc caagatggct accccttcga tgatgccgca gtggtcttac atgcacatct 18780 
cgggccagga cgcctcggag tacctgagcc ccgggctggt gcagtttgcc cgcgccaccg 18840 
agacgtactt cagcctgaat aacaagttta. gaaaccccac ggtggcgcct acgcacgacg 18900 
tgaccacaga ccggtcccag cgtttgacgc tgcggttcat ccctgtggac cgtgaggata 18960 
ctgcgtactc gtacaaggcg cggttcaccc tagctgtggg tgataaccgt gtgctggaca 19020 
tggcttccac gtactttgac atccgcggcg tgctggacag gggccctact tttaagccct 19080 
actctggcac tgcctacaac gccctggctc ccaagggtgc cccaaatcct tgcgaatggg 19140 
atgaagctgc tactgctctt gaaataaacc tagaagaaga ggacgatgac aacgaagacg 19200 
aagtagacga gcaagctgag cagcaaaaaa ctcacgtatt tgggcaggcg ccttattctg 19260 
gtataaatat tacaaaggag ggtattcaaa taggtgtcga aggtcaaaca cctaaatatg 19320 
ccgataaaac atttcaacct gaacctcaaa taggagaatc tcagtggtac gaaactgaaa 19380 
ttaatcatgc agctgggaga gtccttaaaa agactacccc aatgaaacca tgttacggtt 19440 
catatgcaaa acccacaaat gaaaatggag ggcaaggcat tcttgtaaag caacaaaatg 19500 
gaaagctaga aagtcaagtg gaaatgcaat ttttctcaac tactgaggcg accgcaggca 19560 
atggtgataa cttgactcct aaagtggtat tgtacagtga agatgtagat atagaaaccc 19620 
cagacactca tatttcttac atgcccacta ttaaggaagg taactcacga gaactaatgg 19680 
gccaacaatc tatgcccaac aggcctaatt acattgcttt tagggacaat tttattggtc 19740 
taatgtatta caacagcacg ggtaatatgg gtgttctggc gggccaagca tcgcagttga 19800 
atgctgttgt agatttgcaa gacagaaaca cagagctttc ataccagctt ttgcttgatt 19860 
ccattggtga tagaaccagg tacttttcta tgtggaatca ggctgttgac agctatgatc 19920 
cagatgttag aattattgaa aatcatggaa ctgaagatga acttccaaat tactgctttc 19980 
cactgggagg tgtgattaat acagagactc ttaccaaggt aaaacctaaa acaggtcagg 20040 
aaaatggatg ggaaaaagat gctacagaat tttcagataa aaatgaaata agagttggaa 20100 
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ataattttgc catggaaatc aatctaaatg 
acatagcgct gtatttgccc gacaagctaa 
ataacccaaa cacctacgac tacatgaaca 
gctacattaa ccttggagca cgctggtccc 
accaccaccg caatgctggc ctgcgctacc 
tgcccttcca catccaggtg cctcagaagt 
cgggctcata cacctacgag tggaacttca 
ccctaggaaa tgacctaagg gttgacggag 
acgccacctt cttccccatg gcccacaaca 
acgacaccaa cgaccagtcc tttaacgact 
tacccgccaa cgctaccaac gtgcccatat 
gcggctgggc cttcacgcgc cttaagacta 
acccttatta cacctactct ggctctatac 
acacctttaa gaaggtggcc attacctttg 
gcctgcttac ccccaacgag tttgaaatta 
ttgcccagtg taacatgacc aaagactggt 
ttggctacca gggcttctat atcccagaga 
gaaacttcca gcccatgagc cgtcaggtgg 
aggtgggcat cctacaccaa cacaacaact 
tgcgcgaagg acaggcctac cctgctaact 
ttgacagcat tacccagaaa aagtttcttt 
ccagtaactt tatgtccatg ggcgcactca 
actccgccca cgcgctagac atgacttttg 
tttatgtttt gtttgaagtc tttgacgtgg 
tcgaaaccgt gtacctgcgc acgcccttct 
agcaacatca acaacagctg ccgccatggg 
caaagatctt ggttgtgggc catatttttt 
tgtttctcca cacaagctcg cctgcgccat 
cgtacactgg atggcctttg cctggaaccc 
ctttggcttt tctgaccagc gactcaagca 
gcgccgtagc gccattgctt cttcccccga 
aagcgtacag gggcccaact cggccgcctg 
ctttgccaac tggccccaaa ctcccatgga 
ggtacccaac tccatgctca acagtcccca 
acagctctac agcttcctgg agcgccactc 
taggagcgcc acttcttttt gtcacttgaa 
tttcaataaa ggcaaatgct tttatttgta 
tgccgtctgc gccgtttaaa aatcaaaggg 
cagggacacg ttgcgatact ggtgtttagt 
cggcagctcg gtgaagtttt cactccacag 
gtcgggcgcc gatatcttga agtcgcagtt 
atacacaggg ttgcagcact ggaacactat 
gctcttgtcg gagatcagat ccgcgtccag 
caactttggt agctgccttc ccaaaaaggg 
ccgtagtggc atcaaaaggt gaccgtgccc 
aaaagccttg atctgcttaa aagccacctg 
gcaagacttg ccggaaaact gattggccgg 
gtcggtgttg gagatctgca ccacatttcg 
gctagactgc tccttcagcg cgcgctgccc 
gtgctcctta tttatcataa tgcttccgtg 
gcagcggtgc agccacaacg cgcagcccgt 
aaacgactgc aggtacgcct gcaggaatcg 
ggtgaaggtc agctgcaacc cgcggtgctc 
cagagcttcc acttggtcag gcagtagttt 
gtacttgtcc atcagcgcgc gcgcagcctc 
cacactcagc gggttcatca ccgtaatttc 
ctcttgcgtc cgcataccac gcgccactgg 
cttacctcct ttgccatgct tgattagcac 
cgccacatct tctctttctt cctcgctgtc 
. gggcttggga gaagggcgct tctttttctt 
ggtcgatggc cgcgggctgg gtgtgcgcgg 
gtcctcggac tcgatacgcc gcctcatccg 
cgacggggac ggggacgaca cgtcctccat 
gcgctcgggg gtggtttcgc gctgctcctc 
gcagaaaaag atcatggagt cagtcgagaa 
cgccaccacc gcctccaccg atgccgccaa 
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ccaacctgtg gagaaatttc ctgtactcca 20160 
agtacagtcc ttccaacgta aaaatttctg 20220 
agcgagtggt ggctcccggg ttagtggact 20280 
ttgactatat ggacaacgtc aacccattta 20340 
gctcaatgtt gctgggcaat ggtcgctatg 20400 
tctttgccat taaaaacctc cttctcctgc 20460 
ggaaggatgt taacatggtt ctgcagagct 20520 
ccagcattaa gtttgatagc atttgccttt 20580 
ccgcctccac gcttgaggcc atgcttagaa 20640 
atctctccgc cgccaacatg ctctacccta 20700 
ccatcccctc ccgcaactgg gcggctttcc 20760 
aggaaacccc atcactgggc tcgggctacg 20820 
cctacctaga tggaaccttt tacctcaacc 20880 
actcttctgt cagctggcct ggcaatgacc 20940 
agcgctcagt tgacggggag ggttacaacg 21000 
tcctggtaca aatgctagct aactacaaca 21060 
gctacaagga ccgcatgtac tccttcttta 21120 
tggatgatac taaatacaag gactaccaac 21180 
ctggatttgt tggctacctt gcccccacca 21240 
tcccctatcc gcttataggc aagaccgcag 21300 
gcgatcgcac cctttggcgc atcccattct 21360 
cagacctggg ccaaaacctt ctctacgcca 21420 
aggtggatcc catggacgag cccacccttc 21480 
tccgtgtgca ccggccgcac cgcggcgtca 21540 
cggccggcaa cgccacaaca taaagaagca 21600 
ctccagtgag caggaactga aagccattgt 21660 
gggcacctat gacaagcgct ttccaggctt 21720 
agtcaatacg gccggtcgcg agactggggg 21780 
gcactcaaaa acatgctacc tctttgagcc 21840 
ggtttaccag tttgagtacg agtcactcct 21900 
ccgctgtata acgctggaaa agtccaccca 21960 
tggactattc tgctgcatgt ttctccacgc 22020 
tcacaacccc accatgaacc ttattaccgg 22080 
ggtacagccc accctgcgtc gcaaccagga 22140 
gccctacttc cgcagccaca gtgcgcagat 22200 
aaacatgtaa aaataatgta ctagagacac 22260 
cactctcggg tgattattta cccccaccct 22320 
gttctgccgc gcatcgctat gcgccactgg 22380 
gctccactta aactcaggca caaccatccg 22440 
gctgcgcacc atcaccaacg cgtttagcag 22500 
ggggcctccg ccctgcgcgc gcgagttgcg 22560 
cagcgccggg tggtgcacgc tggccagcac 22620 
gtcctccgcg ttgctcaggg cgaacggagt 22680 
cgcgtgccca ggctttgagt tgcactcgca 22740 
ggtctgggcg ttaggataca gcgcctgcat 22800 
agcctttgcg ccttcagaga agaacatgcc 22860 
acaggccgcg tcgtgcacgc agcaccttgc 22920 
gccccaccgg ttcttcacga tcttggcctt 22980 
gttttcgctc gtcacatcca tttcaatcac 23040 
tagacactta agctcgcctt cgatctcagc 23100 
gggctcgtga tgcttgtagg tcacctctgc 23160 
ccccatcatc gtcacaaagg tcttgttgct 23220 
ctcgttcagc caggtcttgc atacggccgc 23280 
gaagttcgcc tttagatcgt tatccacgtg 23340 
catgcccttc tcccacgcag acacgatcgg 23400 
actttccgct tcgctgggct cttcctcttc 23460 
gtcgtcttca ttcagccgcc gcactgtgcg 23520 
cggtgggttg ctgaaaccca ccatttgtag 23580 
cacgattacc tctggtgatg gcgggcgctc 23640 
cttgggcgca atggccaaat ccgccgccga 23700 
caccagcgcg tcttgtgatg agtcttcctc 23760 
cttttttggg ggcgcccggg gaggcggcgg 23820 
ggttggggga cgtcgcgccg caccgcgtcc 23880 
ttcccgactg gccatttcct tctcctatag 23940 
gaaggacagc ctaaccgccc cctctgagtt 24000 
cgcgcctacc accttccccg tcgaggcacc 24060 
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cccgcttgag gaggaggaag tgattatcga 
cgaggaccgc tcagtaccaa cagaggataa 
cgaggaacaa gtcgggcggg gggacgaaag 
cgtgctgttg aagcatctgc agcgccagtg 
cagcgatgtg cccctcgcca tagcggatgt 
accgcgcgta ccccccaaac gccaagaaaa 
cttctacccc gtatttgccg tgccagaggt 
ctgcaagata cccctatcct gccgtgccaa 
gcggcagggc gctgtcatac ctgatatcgc 
gggtcttgga cgcgacgaga agcgcgcggc 
tgaaagtcac tctggagtgt tggtggaact 
aaaacgcagc atcgaggtca cccactttgc 
catgagcaca gtcatgagtg agctgatcgt 
aaatttgcaa gaacaaacag aggagggcct 
ctggcttcaa acgcgcgagc ctgccgactt 
agtgctcgtt accgtggagc ttgagtgcat 
gcgcaagcta gaggaaacat tgcactacac 
caagatctcc aacgtggagc tctgcaacct 
ccgccttggg caaaacgtgc ttcattccac 
ccgcgactgc gtttacttat ttctatgcta 
gcagtgcttg gaggagtgca acctcaagga 
ggacctatgg acggccttca acgagcgctc 
ccccgaacgc ctgcttaaaa ccctgcaaca 
gttgcagaac tttaggaact ttatcctaga 
tgcacttcct agcgactttg tgcccattaa 
ccactgctac cttctgcagc tagccaacta 
cgtgagcggt gacggtctac tggagtgtca 
ctccctggtt tgcaattcgc agctgcttaa 
gcagggtccc tcgcctgacg aaaagtccgc 
gtggacgtcg gcttaccttc gcaaatttgt 
gttctacgaa gaccaatccc gcccgccaaa 
gggccacatt cttggccaat tgcaagccat 
aaagggacgg ggggtttact tggaccccca 
gccgccgcag ccctatcagc agcagccgcg 
agaagctgca gctgccgccg ccacccacgg 
aggaggtttt ggacgaggag gaggaggaca 
aagcttccga ggtcgaagag gtgtcagacg 
cgccggcgcc ccagaaatcg gcaaccggtt 
cgccgccggc actgcccgtt cgccgaccca 
ccggtaagtc caagcagccg ccgccgttag 
gctcatggcg cgggcacaag aacgccatag 
tctccttcgc ccgccgcttt cttctctacc 
tgcattacta ccgtcatctc tacagcccat 
acagcagcgg ccacacagaa gcaaaggcga 
aaatccacag cggcggcagc agcaggagga 
gtatcgaccc gcgagcttag aaacaggatt 
agcaggggcc aagaacaaga gctgaaaata 
agctgcctgt atcacaaaag cgaagatcag 
ctcttcagta aatactgcgc gctgactctt 
taagcgcgaa aactacgtca tctccagcgg 
gccattatga gcaaggaaat tcccacgccc 
cttgcggctg gagctgccca agactactca 
cacatgatat cccgggtcaa cggaatccgc 
gcggctatta ccaccacacc tcgtaataac 
gtgtaccagg aaagtcccgc tcccaccact 
gttcagatga ctaactcagg ggcgcagctt 
cccgggcagg gtataactca cctgacaatc 
tcggtgagct cctcgcttgg tctccgtccg 
cgtccttcat tcacgcctcg tcaggcaatc 
cgctctggag gcattggaac tctgcaattt 
aaccccttct cgggacctcc cggccactat 
gtaaaggact cggcggacgg ctacgactga 
ctgaaacacc tggtccactg tcgccgccac 
tgctactttg aattgcccga ggatcatatc 
gcccagggag agcttgcccg tagcctgatt 
gagcgggaca ggggaccctg tgttctcact 



gcaggaccca ggttttgtaa gcgaagacga 24120 
aaagcaagac caggacaacg cagaggcaaa 24180 
gcatggcgac tacctagatg tgggagacga 24240 
cgccattatc tgcgacgcgt tgcaagagcg 24300 
cagccttgcc tacgaacgcc acctattctc 24360 
cggcacatgc gagcccaacc cgcgcctcaa 24420 
gcttgccacc tatcacatct ttttccaaaa 24480 
ccgcagccga gcggacaagc agctggcctt 24540 
ctcgctcaac gaagtgccaa aaatctttga 24 600 
aaacgctctg caacaggaaa acagcgaaaa 24 660 
cgagggtgac aacgcgcgcc tagccgtact 24720 
ctacccggca cttaacctac cccccaaggt 24780 
gcgccgtgcg cagcccctgg agagggatgc 24840 
acccgcagtt ggcgacgagc agctagcgcg 24 900 
ggaggagcga cgcaaactaa tgatggccgc 24 960 
gcagcggttc tttgctgacc cggagatgca 25020 
ctttcgacag ggctacgtac gccaggcctg 25080 
ggtctcctac cttggaattt tgcacgaaaa 25140 
gctcaagggc gaggcgcgcc gcgactacgt 25200 
cacctggcag acggccatgg gcgtttggca 25260 
gctgcagaaa ctgctaaagc aaaacttgaa 25320 
cgtggccgcg cacctggcgg acatcatttt 25380 
gggtctgcca gacttcacca gtcaaagcat. 25440 
gcgctcagga atcttgcccg ccacctgctg 25500 
gtaccgcgaa tgccctccgc cgctttgggg 25560 
ccttgcctac cactctgaca taatggaaga 25620 
ctgtcgctgc aacctatgca ccccgcaccg 25680 
cgaaagtcaa attatcggta cctttgagct 25740 
ggctccgggg ttgaaactca ctccggggct 25800 
acctgaggac taccacgccc acgagattag 25860 
tgcggagctt accgcctgcg tcattaccca 25920 
caacaaagcc cgccaagagt ttctgctacg 25980 
gtccggcgag gagctcaacc caatcccccc 26040 
ggcccttgct tcccaggatg gcacccaaaa 26100 
acgaggagga atactgggac agtcaggcag 26160 
tgatggaaga ctgggagagc ctagacgagg 26220 
aaacaccgtc accctcggtc gcattcccct 26280 
ccagcatggc tacaacctcc gctcctcagg 26340 
accgtagatg ggacaccact ggaaccaggg 26400 
cccaagagca acaacagcgc caaggctacc 26460 
ttgcttgctt gcaagactgt gggggcaaca 26520 
atcacggcgt ggccttcccc cgtaacatcc 26580 
actgcaccgg cggcagcggc agcggcagca 26640 
ccggatagca agactctgac aaagcccaag 26700 
ggagcgctgc gtctggcgcc caacgaaccc 26760 
tttcccactc tgtatgctat atttcaacag 26820 
aaaaacaggt ctctgcgatc cctcacccgc 26880 
cttcggcgca cgctggaaga cgcggaggct 26940 
aaggactagt ttcgcgccct ttctcaaatt 27000 
ccacacccgg cgccagcacc tgtcgtcagc 27060 
tacatgtgga gttaccagcc acaaatggga 27120 
acccgaataa actacatgag cgcgggaccc 27180 
gcccaccgaa accgaattct cttggaacag 27240 
cttaatcccc gtagttggcc cgctgccctg 27300 
gtggtacttc ccagagacgc ccaggccgaa 27360 
gcgggcggct ttcgtcacag ggtgcggtcg 27420 
agagggcgag gtattcagct caacgacgag 27480 
gacgggacat ttcagatcgg cggcgccggc 27540 
ctaactctgc agacctcgtc ctctgagccg 27600 
attgaggagt ttgtgccatc ggtctacttt 27660 
ccggatcaat ttattcctaa ctttgacgcg 27720 
atgttaagtg gagaggcaga gcaactgcgc 27780 
aagtgctttg cccgcgactc cggtgagttt 27840 
gagggcccgg cgcacggcgt ccggcttacc 27900 
cgggagttta cccagcgccc cctgctagtt 27960 
gtgatttgca actgtcctaa ccttggatta 28020 
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catcaagatc tttgttgcca tctctgtgct gagtataata aatacagaaa ttaaaatata 28080 
ctggggctcc tatcgccatc ctgtaaacgc caccgtcttc acccgcccaa gcaaaccaag 28140 
gcgaacctta cctggtactt ttaacatctc tccctctgtg atttacaaca gtttcaaccc 28200 
agacggagtg agtctacgag agaacctctc cgagctcagc tactccatca gaaaaaacac 28260 
caccctcctt acctgccggg aacgtacgag tgcgtcaccg gccgctgcac cacacctacc 28320 
gcctgaccgt aaaccagact ttttccggac agacctcaat aactctgttt accagaacag 28380 
gaggtgagct tagaaaaccc ttagggtatt aggccaaagg cgcagctact gtggggttta 28440 
tgaacaattc aagcaactct acgggctatt ctaattcagg tttctctagg gttggggtta 28500 
ttctctgtct tgtgattctc tttattctta tactaacgct tctctgccta aggctcgccg 28560 
cctgctgtgt gcacatttgc atttattgtc agctttttaa acgctggggt cgccacccaa 28620 
gatgattagg tacataatcc taggtttact cacccttgcg tcagcccacg gtaccaccca 28680 
aaaggtggat tttaaggagc cagcctgtaa tgttacattc gcagctgaag ctaatgagtg 28740 
caccactctt ataaaatgca ccacagaaca tgaaaagctg cttattcgcc acaaaaacaa 28800 
aattggcaag tatgctgttt atgctatttg gcagccaggt gacactacag agtataatgt 28860 
tacagttttc cagggtaaaa gtcataaaac ttttatgtat acttttccat tttatgaaat 28920 
gtgcgacatt accatgtaca tgagcaaaca gtataagttg tggcccccac aaaattgtgt 28980 
ggaaaacact ggcactttct gctgcactgc tatgctaatt acagtgctcg ctttggtctg 29040 
taccctactc tatattaaat acaaaagcag acgcagcttt attgaggaaa agaaaatgcc 29100 
ttaatttact aagttacaaa gctaatgtca ccactaactg ctttactcgc tgcttgcaaa 29160 
acaaattcaa aaagttagca ttataattag aataggattt aaaccccccg gtcatttcct 29220 
gctcaatacc attcccctga acaattgact ctatgtggga tatgctccag cgctacaacc 29280 
ttgaagtcag gcttcctgga tgtcagcatc tgactttggc cagcacctgt cccgcggatt 29340 
tgttccagtc caactacagc gacccaccct aacagagatg accaacacaa ccaacgcggc 29400 
cgccgctacc ggacttacat ctaccacaaa tacaccccaa gtttctgcct ttgtcaataa 294 60 
ctgggataac ttgggcatgt ggtggttctc catagcgctt atgtttgtat gccttattat 29520 
tatgtggctc atctgctgcc taaagcgcaa acgcgcccga ccacccatct atagtcccat 29580 
cattgtgcta cacccaaaca atgatggaat ccatagattg gacggactga aacacatgtt 29640 
cttttctctt acagtatgat taaatgagac atgattcctc gagtttttat attactgacc 29700 
cttgttgcgc ttttttgtgc gtgctccaca ttggctgcgg tttctcacat cgaagtagac 29760 
tgcattccag ccttcacagt ctatttgctt tacggatttg tcaccctcac gctcatctgc 29820 
agcctcatca ctgtggtcat cgcctttatc cagtgcattg actgggtctg tgtgcgcttt 29880 
gcatatctca gctgctgcca tgttgtgttg ctaccatgtt gttttcatgt gttgctgcca 29940 
tgctcttgtc gccttagatc tctctttatg tagtgttgtg gtgtctctct tgtcgtgatg 30000 
tgtgttttgt cctatatatt ttaattttta atccaaaccc ctgtccccgc agaggccttt 30060 
gcgttctggt aggccgtcat tgaaaactga cttaactcgt taaattaaaa aaatgtaaaa 30120 
aataatggtt gagactcagc ccaacatcgg cagatgaggt ggattgagac tcagcccaac 301B0 
atcggcagat gaggtggatt gagactcaac cccaacattg gcagatgagg tgaattagat 30240 
gaggtggatt gagactcatg agggtggtat gagggcccga cgtccacagg tgggagttgt 30300 
gctttacagt ccaacgtgca ggacgcttgg catttgccag agaacaccaa gattggcaaa 30360 
ttcgcaactg gcgccctgtg ctcttcacag acggaaaaat gaccaaaatc tgattatttt 30420 
tgtaaaacgg aaaccgaatg tccgacaaag ttcatttgat gacttcccgg taggtctgcc 30480 
ctgccgctgg gccgacgccg tccgggaatt ttacaaacga tttcggacgt ctagcattca 30540 
ctcaccttgt caaggacctg aggatctctg cacccttatt aagaccctgt gcggtctcaa 30600 
agatcttatt ccctttaact aataaaaaaa aataataaag catcacttac ttaaaatcag 30660 
ttagcaaatt tctgtccagt ttattcagca gcacctcctt gccctcctcc cagctctggt 30720 
attgcagctt cctcctggct gcaaactttc tccacaatct aaatggaatg tcagtttcct 30780 
cctgttcctg tccatccgca cccactatct tcatgttgtt gcagatgaag cgcgcaagac 30840 
cgtctgaaga taccttcaac cccgtgtatc catatgacac ggaaaccggt cctccaactg 30900 
tgccttttct tactcctccc tttgtatccc ccaatgggtt tcaagagagt ccccctgggg 30960 
tactctcttt gcgcctatcc gaacctctag ttacctccaa tggcatgctt gcgctcaaaa 31020 
tgggcaacgg cctctctctg gacgaggccg gcaaccttac ctcccaaaat gtaaccactg 31080 
tgagcccacc tctcaaaaaa accaagtcaa acataaacct ggaaatatct gcacccctca 31140 
cagttacctc agaagcccta actgtggctg ccgccgcacc tctaatggtc gcgggcaaca 31200 
cactcaccat gcaatcacag gccccgctaa ccgtgcacga ctccaaactt agcattgcca 31260 
cccaaggacc cctcacagtg tcagaaggaa agctagccct gcaaacatca ggccccctca 31320 
ccaccaccga tagcagtacc cttactatca ctgcctcacc ccctctaact actgccactg 31380 
gtagcttggg cattgacttg aaagagccca tttatacaca aaatggaaaa ctaggactaa 31440 
agtacggggc tcctttgcat gtaacagacg acctaaacac tttgaccgta gcaactggtc 31500 
caggtgtgac tattaataat acttccttgc aaactaaagt tactggagcc ttgggttttg 31560 
attcacaagg caatatgcaa cttaatgtag caggaggact aaggattgat tctcaaaaca 31620 
gacgccttat acttgatgtt agttatccgt ttgatgctca aaaccaacta aatctaagac 31680 
taggacaggg ccctcttttt ataaactcag cccacaactt ggatattaac tacaacaaag 31740 
gcctttactt gtttacagct tcaaacaatt ccaaaaagct tgaggttaac ctaagcactg 31800 
ccaaggggtt gatgtttgac gctacagcca tagccattaa tgcaggagat gggcttgaat 31860 
ttggttcacc taatgcacca aacacaaatc ccctcaaaac aaaaattggc catggcctag 31920 
aatttgattc aaacaaggct atggttccta aactaggaac tggccttagt tttgacagca 31980 



WO 01/04282 



74 



PCI7US00/18971 



caggtgccat tacagtagga aacaaaaata 
ctccatctcc taactgtaga ctaaatgcag 
caaaatgtgg cagtcaaata cttgctacag 
ctccaatatc tggaacagtt caaagtgctc 
tgctactaaa caattccttc ctggacccag 
ctgaaggcac agcctataca aacgctgttg 
aatctcacgg taaaactgcc aaaagtaaca 
aaactaaacc tgtaacacta accattacac 
ctccaagtgc atactctatg tcattttcat 
aaatatttgc cacatcctct tacacttttt 
gtgttatgtt tcaacgtgtt tatttttcaa 
agtagtatag ccccaccacc acatagctta 
agaaccctag tattcaacct gccacctccc 
ccccggctgg ccttaaaaag catcatatca 
ttccacacgg tttcctgtcg agccaaacgc 
agctcactta agttcatgtc gctgtccagc 
ggttgcttaa cgggcggcga aggagaagtc 
tgcatcagga tagggcggtg gtgctgcagc 
tccgtcctgc aggaatacaa catggcagtg 
agcataaggc gccttgtcct ccgggcacag 
cagtaactgc agcacagcac cacaatattg 
ccaaagctca tggcggggac cacagaaccc 
attaagtggc gacccctcat aaacacgctg 
taattcacca cctcccggta ccatataaac 
atcctaaacc agctggccaa aacctgcccg 
gaacaatgac agtggagagc ccaggactcg 
tcaatgttgg cacaacacag gcacacgtgc 
cgcgttagaa ccatatccca gggaacaacc 
cagggaagac ctcgcacgta actcacgttg 
agcggatgat cctccagtat ggtagcgcgg 
ctactgtacg gagtgcgccg agacaaccga 
ggaacgccgg acgtagtcat atttcctgaa 
tctgcgtctc cggtctcgcc gcttagatcg 
tctcaaagca tccaggcgcc ccctggcttc 
tgccctgata acatccacca ccgcagaata 
ctgcgagtca cacacgggag gagcgggaag 
ccaaaagatt atccaaaacc tcaaaatgaa 
tggcgtggtc aaactctaca gccaaagaac 
tggcttccaa aaggcaaacg gccctcacgt 
ggtgaatctc ctctataaac attccagcac 
gccaccttct caatatatct ctaagcaaat 
tctgctccag agcgccctcc accttcagcc 
aggttcctca cagacctgta taagattcaa 
ccgtaggtcc cttcgcaggg ccagctgaac 
ggccacttcc ccgccaggaa ccttgacaaa 
cggagctatg ctaaccagcg tagccccgat 
aatgcaaggt. gctgctcaaa aaatcaggca 
agtcatgctc atgcagataa aggcaggtaa 
tttttctctc aaacatgtct gcgggtttct 
catttaaaca ttagaagcct gtcttacaac 
gactacggcc atgccggcgt gaccgtaaaa 
cgacagctcc tcggtcatgt ccggagtcat 
attcatcggt cagtgctaaa aagcgaccga 
gtagagacaa cattacagcc cccataggag 
cataaacacc tgaaaaaccc tcctgcctag 
catacagcgc ttcacagcgg cagcctaaca 
ttaaaaaaac accactcgac acggcaccag 
agtgcgttac actgcagcag gtgtgactca 
gcttggggca tggcccctta tagctgggcg 
acctcagtgt ttgtctttgc tctgaagagc 
gcaggaacac tcctgcctgc cttaccacct 
ttgccccctg cccagactcc catgttcctg 
caagcctcca tacctggtcc cacctctcca 
gggcatctgg ttgggggcag cctgggtgtt 
gccccctcta ctcttgagca atgctcttga 
aagccctgga agggcagacc caggacactc 



atgataagct aactttgtgg accacaccag 32040 
agaaagatgc taaactcact ttggtcttaa 32100 
tttcagtttt ggctgttaaa ggcagtttgg 32160 
atcttattat aagatttgac gaaaatggag 32220 
aatattggaa ctttagaaat ggagatctta 32280 
gatttatgcc taacctatca gcttatccaa 32340 
ttgtcagtca agtttactta aacggagaca 32400 
taaacggtac acaggaaaca ggagacacaa 32460 
gggactggtc tggccacaac tacattaatg 32520 
catacattgc ccaagaataa agaatcgttt 32580 
ttgcagaaaa tttcaagtca tttttcattc 32640 
tacagatcac cgtaccttaa tcaaactcac 32700 
tcccaacaca cagagtacac agtcctttct 32760 
tgggtaacag acatattctt aggtgttata 32820 
tcatcagtga tattaataaa ctccccgggc 32880 
tgctgagcca caggctgctg tccaacttgc 32940 
cacgcctaca tgggggtaga gtcataatcg 33000 
agcgcgcgaa taaactgctg ccgccgccgc 33060 
gtctcctcag cgatgattcg caccgcccgc 33120 
cagcgcaccc tgatctcact taaatcagca 33180 
ttcaaaatcc cacagtgcaa ggcgctgtat 33240 
acgtggccat cataccacaa gcgcaggtag 33300 
gacataaaca ttacctcttt tggcatgttg 33360 
ctctgattaa acatggcgcc atccaccacc 33420 
ccggctatac actgcaggga accgggactg 33480 
taaccatgga tcatcatgct cgtcatgata 33540 
atacacttcc tcaggattac aagctcctcc 33600 
cattcctgaa tcagcgtaaa tcccacactg 33660 
tgcattgtca aagtgttaca ttcgggcagc 33720 
gtttctgtct caaaaggagg tagacgatcc 33780 
gatcgtgttg gtcgtagtgt catgccaaat 3384 0 
gcaaaaccag gtgcgggcgt gacaaacaga 33900 
ctctgtgtag tagttgtagt atatccactc 33960 
gggttctatg taaactcctt catgcgccgc 34020 
agccacaccc agccaaccta cacattcgtt 34080 
agctggaaga accatgtttt tttttttatt 34140 
gatctattaa gtgaacgcgc tcccctccgg 34200 
agataatggc atttgtaaga tgttgcacaa 34260 
ccaagtggac gtaaaggcta aacccttcag 34320 
cttcaaccat gcccaaataa ttctcatctc 34380 
cccgaatatt aagtccggcc attgtaaaaa 34440 
tcaagcagcg aatcatgatt gcaaaaattc 34500 
aagcggaaca ttaacaaaaa taccgcgatc 34560 
ataatcgtgc aggtctgcac ggaccagcgc 34 620 
agaacccaca ctgattatga cacgcatact 34 680 
gtaagctttg ttgcatgggc ggcgatataa 34740 
aagcctcgcg caaaaaagaa agcacatcgt 34800 
gctccggaac caccacagaa aaagacacca 34860 
gcataaacac aaaataaaat aacaaaaaaa 34920 
aggaaaaaca acccttataa gcataagacg 34980 
aaactggtca ccgtgattaa aaagcaccac 35040 
aatgtaagac tcggtaaaca catcaggttg 35100 
aatagcccgg gggaatacat acccgcaggc 35160 
gtataacaaa attaatagga gagaaaaaca 35220 
gcaaaatagc accctcccgc tccagaacaa 35280 
gtcagcctta ccagtaaaaa agaaaaccta 35340 
ctcaatcagt cacagtgtaa aaaagggcca 35400 
gccatggcac ctctgcagcc tgggtaccct 35460 
gggcgtgggg gctctgtagg agtggcagcg 35520 
cctccaggtg cttgatccca ccttttccca 35580 
gtcctggctg atggcctgtt cctgcctcct 35640 
gacttgtggc ttcctccaac caggggctct 35700 
ggccgtggga gggaggttga ggagggtgga 35760 
cccctcccat cccctccctg ggcctcccag 35820 
gagcttcctg cctggctctt aacccagggc 35880 
tcaccacctc cttacctttt cccctggaaa 35940 
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aatcttctgt atacttccca ttttaagaaa actacaattc ccaacacata caagttactc 36000 
cgccctaaaa cctacgtcac ccgccccgtt cccacgcccc gcgccacgtc acaaactcca 36060 
ccccctcatt atcatattgg cttcaatcca aaataaggta tattattgat gatg 36114 

<210> 17 
<2H> 40 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 17 

Met Thr Gly Ser Thr lie Ala Pro Thr Thr Asp Tyr Arg Asn Thr Thr 
15 10 15 

Ala Thr Gly Leu Thr Ser Ala Leu Asn Leu Pro Gin Val His Ala Phe 
20 25 30 

Val Asn Asp Trp Ala Ser Leu Asp 
35 40 



<210> 18 
<211> 19 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 18 

Met Trp Trp Phe Ser lie Ala Leu Met Phe Val Cys Leu He He Met 
1 5 10 ' 15 

Trp Leu He 



<210> 19 
<211> 8 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 19 

Lys Arg Arg Arg Ala Arg Pro Pro 
1 5 



<210> 20 
<211> 42 
<212> PRT 

<213> Adenovirus subgroup C 
<400> 20 

Cys Cys Leu Lys Arg Arg Arg Ala Arg Pro Pro He Tyr Arg Pro He 
15 10 15 

He Val Leu Asn Pro His Asn Glu Lys He His Arg Leu Asp Gly Leu 
20 25 30 

Lys Pro Cys Ser Leu Leu Leu Gin Tyr Asp 
35 40 
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